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Preface 


We are happy to honor three well-known number theorists, Henry 
B. Mann, Arnold E. Ross,and Olga Taussky-Todd. We do this at a time 
when they can look back over many years of honest toil and productive 
work. 

We, their colleagues, pupils, collaborators, and friends find it fitting to 
dedicate to them the fruits of our work so as to pass on to researchers 
coming after us the spirit of patient toil in the service of the queenly science 
which our three honorees implanted in us. 

We would like to thank the publisher for encouraging the publication 
of this special volume. We are also grateful for the overwhelming response 
by the authors upon this happy occasion. 

As a study of the table of contents of this book will suggest to the 
reader, there is an enormous variety and depth to the research efforts of the 
modern number theorist and algebraist. As a first guide to the reader who 
is familiar with the traditional classification of the fields under consideration, 
we would like to group the contents along the following lines. 

Elementary number theory is one of the most ancient sources of number 
theoretical and algebraic speculation which only now seems to be able to 
find its true depth. See for example the contribution of “Erd6s” type by 
P. Erdos and E. G. Strauss. Richard K. Guy’s contribution has a more 
experimental flavor. Emma Lehmer follows in the steps of C. F. Gauss 
(1777-1855). An application of Lie algebra is made by J. W. B. Hughes. 

Numbers as objects of statistical analytic number theory investigations 
are dealt with by K. Mahler, by R. L. Graham, H. S. Witsenhausen, and 
J. H. Spencer; also by Wolfgang M. Schmidt as well as by the late 
S. Knapowski and P. Turan. 

An application of the theory of modular forms is given by Bram van Asch. 

The combinatorial aspects of number theory and their connection with 
group theory are dealt with in the contribution by John E. Olson and 
Edward T. White; also by Sundar Lal, R. L. McFarland, and R. W. K. Odoni 


XV 
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in response to a challenging problem of H. Mann on difference sets. The 
combinatorial properties of finite fields are dealt with by Ὁ. K. Ray-Chaudhuri 
and N. M. Singhi. An application of such ideas in finite projective geometry 
was made by Jill C. D. S. Yaqub. 

The algebraic theory of quadratic forms and their applications is expounded 
by Warren Wolfe. A novel more abstract treatment of the theory is given 
by Manfred Knebusch. 

The classical arithmetical theory of quadratic forms is dealt with in the 
articles by A. G. Earnest and J. 5. Hsia, Robert Gold and Paul Ponomarev, 
and by Marshall Hall, Jr. 

The algebra of finite extensions was enriched by the late Bohuslav Divis in 
reply to a challenging question of A. Schinzl. 

The theory of algebraic number fields and their arithmetics is richly 
represented by the contributions of D. S. Dummit and H. Kisilevsky, 
A. Frohlich, Dennis A. Garbanati, Basil Gordon and Murray Schacher, 
Gary Queen, Ross Schipper, William Yslas Vélez, by Sunder Lal, R. L. 
McFarland, and R. W. K. Odoni, and by M. Pohst. 

The principles of noncommutative arithmetics are dealt with by Wilhelm 
Plesken and by Hans Zassenhaus. 

The geometry of numbers, an ally of number theoretical research since 
Gauss and Dirichlet, which was developed as a new mathematical science 
by Hermite and Minkowski, is represented by the article of R. P. Bambah 
and A. C. Woods. 

The algebraic geometry of curves providing a rich field of investigation 
in the spirit of number theory is dealt with in the contribution of Manohar 
L. Madan and Sat Pal. An application in the same spirit on the theory of 
formal power series is given by Charles F. Osgood. 


Biographical Sketches 


Henry B. Mann 


In 1975 Henry B. Mann celebrated his seventieth birthday. A mathemati- 
cian of international fame, Mann, in a career of more than forty years, has 
made significant contributions to algebra, number theory, statistics, and 
combinatorics. | 

Henry Mann was born October 27, 1905, in Vienna. He received his 
Ph.D. degree in mathematics in 1935 from the University of Vienna where, 
as a student of Philipp Furtwangler, he wrote his dissertation in algebraic 
number theory. After a year of teaching school in Vienna and a couple of 
years spent in research and tutoring, he emigrated in 1938 to the United 
States. 

In New York he earned his living for several years primarily by tutoring. 
He had by then developed an interest in mathematical statistics, particularly 
in the analysis of variance, and in the problem of designing experiments with 
a view to their statistical analysis. He later contributed to this subject in a 
number of research papers and in his book (1949) “Analysis and Design of 
Experiments.” 

One of Mann’s most remarkable achievements was his discovery in 1941 
of a proof of a celebrated conjecture of Schnirelmann and Landau in addi- 
tive number theory. This conjecture had its origin in the work of L. Schnirel- 
mann in the early 1930s. Schnirelmann had considered a density 6(A) for a 
set of positive integers A, which he defined by 


6(A) = inf{A(n)/n|n = 1, 2, ...}, 
where A(n) is the number of positive integers <n in the set A. He showed 
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(most easily) that if the sum A + B of two sets of positive integers is formed 
by A+ B={a, b,a+ blae A, be B}, then the density satisfies the rules: 


(1) 65(A + B) > d(A) + δ(Β) — 5(A)6(B). 
(2) A+ Bcontains all positive integers if 6(A) + 6(B) = 1. 


From these two rules he obtained (readily) the result that any set having 
positive density is a basis for the integers (that is, if 6(A) > 0, then the sum of 
A with itself sufficiently many times contains all positive integers). As an 
application of these ideas, Schnirelmann proved (for the first time) the exist- 
ence of a value k such that every integer greater than 1 is the sum of at most Κα 
primes. This he did by showing that P + P—P 1s the set of primes together 
with 1—has positive density, hence is a basis for the integers. 

Out of further study of these ideas by Schnirelmann himself and by E. 
Landau, there arose the conjecture that (1) and (2) may be replaced by the 
much stronger statement: Either A + B contains all positive integers or 


6(A + B) > δ(Α) + δ(Β.. 


This conjecture, appealing in its apparent simplicity, soon attracted wide 
attention. Many distinguished mathematicians attempted to find a proof; 
indeed, partial results were obtained over the next decade by E. Landau, A. 
Khintchine, A. Besicovitch, I. Schur, and A. Brauer. 

It was this conjecture that Mann succeeded in proving in 1941. His 
interest in the problem had been aroused through the lectures of A. Brauer 
at New York University. Actually, he proved the still sharper statement: If 
C = A+ B, then either C(n) =n or 
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For his proof he was awarded the Cole Prize in Number Theory by the 
American Mathematical Society in 1946. The technique that Mann in- 
troduced in his proof, and its various modifications, have led to further 
important results in additive number theory and have also proved useful in 
the more general setting of additive problems in groups. 

In 1942 Mann was the recipient of a Carnegie Fellowship for the study of 
statistics at Columbia University. At Columbia he had the opportunity of 
working with Abraham Wald in the department of economics, which at that 
time was headed by Harold Hotelling. He taught for a year (1943-1944) in 
the Army Specialized Training Program at Bard College; he spent a year 
(1944-1945) as research associate at Ohio State University, and six months 
as research associate at Brown University. In 1946 he returned to Ohio State 
to join the mathematics faculty where, as associate professor (1946-1948) 
and full professor (1948-1964), he was actively engaged in teaching and 
research for many years. He held professorships at the University of Wiscon- 
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sin Mathematics Research Center from 1964 to 1971, and at the University 
of Arizona from 1971 until his retirement in 1975. 

Mann’s research interests in algebra and combinatorics cover a wide 
range. He has a special fondness, though, for algebraic number theory and 
Galois theory, and has imparted his enthusiasm for these subjects to many 
students over the years. Besides his dozen or so papers that contribute 
directly to these subjects, several of his papers on difference sets and coding 
theory contain beautiful applications of theorems on algebraic numbers and 
Galois theory. 

Neither Henry Mann’s research nor, for that matter, his teaching have 
ended with his retirement; he spent the past spring term as visiting professor 
at Oregon State University. He and his wife Anne, who this year observed 
their 41st wedding anniversary, plan to continue living in Tucson, Arizona. 

His friends, colleagues, and students wish him many more years of good 
health and enjoyment of mathematics. 


JOHN OLSON 
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Arnold E. Ross 


The following is a quotation from an address in honor of 
Arnold E. Ross 


at the Meeting of the Ohio Section of the Mathematical Association of America 
at Youngstown, Ohio, May 7, 1976 


Tonight we are here to honor Arnold E. Ross, the man, the mathemati- 
cian, and the leader of men. 

We are very glad to have Arnold and Bee with us. She has been his 
companion in everyday’s perplexities and his pillar of strength. 

Born in Chicago in 1906 to an immigrant couple from southern Russia, 
Arnold was taken by his mother back to the old country in his early youth 
for a visit with his grandparents. The first World War and then the revolu- 
tion cut them off from the USA for many years. He received most of his 
precollege education in Odessa, a cultured city on the Black Sea which 
provided the early environment of many famous scientists, musicians, and 
politicians. 

Early influences are most formative and of the greatest impact on the 
intellectual development of a person and on the direction of his life work. 

Arnold must have been a precocious youngster since he was able to enter 
the University under a special arrangement at less than sixteen years of age. 
It was Professor Shatunovsky who was deeply interested in very gifted 
youngsters and who was a friend of Arnold Ross’ physician uncle who 


XXVil 


XXVili ARNOLD E. ROSS 


encouraged this early start and helped with the needed arrangements. Be- 
sides Arnold, a cousin of his as well as one of his friends and another 
future mathematician, Felix Gantmacher, took part in the program, all of 
about the same boyish age. S. O. Shatunovsky (1859-1929), a distinguished 
Russian mathematician and educator, was one of the leaders of science and 
mathematics education in postrevolutionary Russia. Another prominent 
leader of science education in the 1930s in Russia was the energetic 
physicist Peter Kapitza (1894b.). 

A small number of farsighted distinguished Russian scientists and math- 
ematicians building on a centuries old excellent academic tradition 
succeeded in the twenties and thirties in designing and implementing a 
broadly based, competitive system of science and mathematics education in 
the elementary and secondary schools of the USSR, succeeding thereby in 
creating many more opportunities for the talented children of all sectors of 
the people than there had been available in czarist Russia. 

In this country, Arnold’s summer program for gifted high school stu- 
dents and a few other similarly oriented programs amply demonstrated in 
the sixties and seventies that a sizeable number of boys and girls at an age of 
13 or slightly older are at the peak of their learning power, full of curiosity 
and eagerness to explore new avenues of thought, ready to be lead by skillful 
teachers much further than is presently possible at the high schools of this 
country. 

Hungary and Belgium also have long traditions of talent search and 
nurture from which Arnold and his helpers have profited. : 

I can testify about a similar experience in Germany, where my parents 
sent me to a new school founded in Hamburg, Germany, a few years after 
the first World War, by a group of teachers who believed in “unlimited 
horizons” of learning for boys and girls between 10 and 18. Though the 
science and math training at the Lichtwarck Schule was no different from 
the traditional fare of other Hamburgian schools, in all other subjects— 
languages, music, art and handcraft, history and government, art apprecia- 
tion, and philosophy—we received far more stimulation and we were given 
a good many more opportunities of independent studies than were available 
to other Hamburgian high school students. 

The blessings of early nurture of intellectual curiosity by the careful 
attention of excellent teachers make the recipients desirous of conferring 
similar blessings on the next generation. 

When Arnold came to Chicago in 1922 he was ready to study; but he first 
went to an engineering school and only in 1925 he enrolled in the study of 
higher mathematics at the University of Chicago. 

I know many other colleagues whose love of mathematics was kindled 
while they were studying engineering. Raoul Bott started out as an engineer- 
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ing student and so did Hans Schwerdtfeger. At the University of Chicago 
Arnold fell under the spell of E. H. Moore (1862-1932), the well-known 
mathematician, educator, and organizer. It was of course E. H. Moore’s 
problem-oriented method of teaching by self-discovery that left its perman- 
ent imprint on his mind as it is bound to influence everybody who has ever 
come under its spell.t On the basis of the earlier experience at the University 
of Odessa the new experience served as a catalyst for what became visible 
many years later. Shatunovsky and Moore both pioneered the directed limit 
generalization of the limit concept of analysis; both were very influential 
educational organizers and leaders in their great countries. 

Arnold E. Ross graduated Ph.D. under L. E. Dickson in 1931. Sub- 
sequently he published a number of deep researches on the arithmetical 
properties of ternary quadratic forms. The big unsolved question remains 
whether a real ternary indefinite quadratic form that is bounded away from 
zero must be proportionate to a rational form. 

Two postdoctoral years were spent at the California Institute of Techno- 
logy, 1931-1933, where E. T. Bell and E. Bateman were active at that time. 
The next two years saw Arnold teaching mathematics at the Chicago City 
College Cooperative College, an academic self-help effort of the depression 
years which must have presented its organizers with quite a variety of non- 
trivial teaching problems. 

Arnold spent his early academic career at the Department of Mathema- 
tics of the University of St. Louis at St. Louis, Missouri 1935-1946. In those 
years there was ample time to discuss all the problems of a small, but lively 
department of mathematics as well as the problems of higher mathematics 
and science education in a new country with colleagues like S. Warshawsky 
and G. Szeg6 and visitors like E. Artin, all three of them recent immigrants 
from Europe. 

Following the European tradition, Arnold always took a strong interest 
in the sciences, in particular in modern physics. It was because of his friend- 
ship with C. Mullins and J. Guth that in 1946 they suggested to Father Kavan- 
augh, the president of the University of Notre Dame, Indiana, to ask Arnold 
E. Ross to succeed Karl Menger as the chairman of the Mathematics 
Department. One of the first things the new chairman did, as Arnold once 
told me, was to change the alphabetic listing of the freshman class lists to 
grouping by experience and readiness to study. He went in person to the 
President to achieve this change by force of his conviction. I think this little 
anecdote is characteristic of the length to which Professor Ross is likely to go 
for his students. 


+ See F. Browder, “ Youthful enjoyment of rigorous thinking,” University of Chicago Maga- 
zine, Winter 1975; and A. E. Ross, A preliminary report on a talent search program at the 
University of Chicago, November 1975. 
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One of his first Notre Dame students was Charles Misner, now a famous 
theoretical physicist and cosmologist who profited much from the ambitious 
undergraduate honors program developed jointly by A. E. Ross and C. 
Mullins. 

During his 28 years as Chairman of the Department of Mathematics of 
the Notre Dame University (1946-1963) and The Ohio State University 
(1963-1974) Arnold E. Ross built up from inconspicuous beginnings a cor- 
nucopia of original training programs reflecting the essence of his own early 
experiences at Odessa and Chicago that talent is everywhere, it only needs to 
be nurtured lovingly to come to fruition. 

Let me describe these programs in their fully developed form. 


1. Summer program for gifted high school students (1964-1974 OSU, 
1975 University of Chicago; ongoing; some NSF support; idea of program 
spreading to India and Australia, generating strong interest in Germany). 

“Is community affluence a necessary prerequisite for early excellence? 
No, also small urban or rural communities, even the heart of the city com- 
munities harbours early excellence, it only must be developed” [13]. 

From a national reservoir of applications between 40 and 65 gifted boys 
and girls from the ages of 14 to 17 are selected for intensive on campus 
training by the best available teachers over a period of eight weeks for one, 
two, or three summers. Supervised housing is provided in campus dormi- 
tories. Former students of the program and other students act as counsellors 
and helpers for the young visitors who are under their care and supervision 
during the period of the training program. 

I have often taught in these summer programs. The students I had there 
were my best ever. I could go as far with them (in essence) during a period of 
eight weeks as I could go with a normal graduate student class in the course of 
three quarters. I remember still vividly Mary, one of the gifted high school 
girls, sitting up on a tall tree in the middle of the Oval, deeply engaged in 
the study of a math book while everybody else basked in the warm summer 
sunshine. 

The talented student brings as much to the program as he carries away! 

Professor A. E. Ross discovered in a life long experience of discovery 
with his youngsters that a course in elementary number theory is particularly 
suitable to test and develop the talents of a young gifted person. Here follows a 
quotation from [14] regarding the objectives of the number theory training: 
“It is our aim to develop attitudes as well as skills, to introduce the students 
to an intelligent use of algorithms as well as to the mastery of the underlying 
theory, and to make use of number theory not only as an important stepping 
stone to the study of both analysis and algebra but also as an environment in 
which we can exhibit a whole gamut of vital dilemmas which confront any 
scientist in any field at one time or another. 
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“From the very beginning the student is given an opportunity to develop 
his powers of observation, to experiment, and to discover significant rela- 
tions between the objects of his experimentation. The student learns to use 
the device of counterexamples to destroy the untenable conjectures, and as 
his experience grows he learns the meaning of providing the security of a 
proof for the surviving conjectures. As is natural in all fields of human 
activity the word labels follow recognition of phenomena, and the incentive 
for the precise and concise use of language comes from the desire and the 
need to share one’s experience with others.” 

In the spirit of discovery so natural to all number theorists, and on the 
basis of the mathematical skills developed in the initial number theory 
course many other topics were taught successfully to those youngsters, e.g., 
mathematical logic (I. Thomas), geometry (J. Yaqub and H. Zassenhaus), 
linear algebra (Ledermann), algorithmic mathematics (H. Brown, H. Zas- 
senhaus), combinatorics (D. Ray-Chaudhuri, N. Robertson, T. Dowling, and 
R. Wilson), analytic number theory (P. Turan), geometry of numbers (K. 
Mahler, A. Woods, P. Bambah, F. H. Davenport), advanced number theory 
(A. E. Ross), discrete groups (H. Zassenhaus), discrete geometry (A. 
Heppes). As in all other projects he accomplished, Arnold Ross used the 
principle “to match the talent of the teacher with the talent of the students.” 


2. Academic Year Institute. A strong program of training master 
teachers implemented at the University of Notre Dame 1960-1970 with NSF 
support. 

Distinguishing features: Classes of not more than 20 selected mathema- 
tics teachers trained intensively during two summer periods of about six 
weeks each and an academic year between. 

The choice of curriculum is guided by F. Klein’s ideas on “elementary 
mathematics from the higher view point.” Training offered in number theory 
(introductory), hard analysis (real and complex), linear algebra (geometric 
algebra), and geometry (affine and projective). 

In my 1963 article} I reported about my experience of teaching geometric. 
algebra while Y. Rainich was teaching the geometry. It is the renewal of 
commitment to basic mathematical ideas which distinguishes this program 
from an unstructured university program as well as from a program of 
studies in a college of education without the input of mathematicians. It is 
the three-phase nature of this particular program which gives it its depth. Of 
course the majority of the student participants continued their teaching 
career, as was the intention of the program. But many of them were placed 
into positions of increased administrative responsibility. About a third of the 
trainees went on to graduate Ph.D. at a number of universities and are now 


+ H. Zassenhaus, “The concept of depth in teacher training,” Amer. Math. Monthly 70 
(1963), 85-88. 
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in academic life. One graduate (Burt Kaufman), after an unusual academic 
and teaching career, founded a Federal laboratory for mathematical curricu- 
lum research and development which is now operating at St. Louis, 
Missouri. 


3. Proposal to upgrade the performance of graduate student assist- 
ants (5. A. E. Ross, 1965) This became a living way of introducing the un- 
tutored incoming graduate student of mathematics at the Ohio State 
University into their subsequent course of more rigorous studies during the 
summers 1970-1974 by a specially designed summer program. It was always 
interesting to watch how the enthusiasm of the gifted high school students 
slowly infected the incoming graduate students, sometimes even the few 
older graduate students who were able to keep the pace of an ambitious 
eight week program. 


4. Basic algebra graduate course. A new course originally taught by A. 
E. Ross (1967/68, 1968/69) to inexperienced incoming graduate students 
during the academic session. Based on lots of number theoretic examples the 
basic concepts of contemporary algebra are developed. The class is 
regrouped several times, and the top section is transformed in the course of 
the third quarter or in the subsequent summer quarter to a group of budding 
professional mathematicians. 

For the three programs: 


5. Horizons Unlimited, 

6. New Careers, and 

7. Saturday Morning Program cp. A. E. Ross, 1969, 1970. The Sat- 
urday morning classes started out as a Sunday class at Notre Dame Univer- 
sity for local children who liked to be exposed to challenges of their 
mathematical inventiveness. For practical reasons the time of the weekly 
gatherings was changed to Saturday mornings. I have participated in this 
enterprise for several years; I was amazed at the distances boys and girls 
would travel (as far as from Gary to Notre Dame, or farther) to pick up 
additional information on regular polyhedra on rigid movements of the 
plane. Once, we spent nearly an hour on just discussing and illustrating the 
nature of the identity mapping interpreted as a rigid “motion.” 


Saturday morning classes are now being conducted by A. E. Ross and his 
staff in cooperation with the Columbus school system. The value of this kind 
of university input into the local community cannot be measured in terms of 
dollars and cents. Our environment becomes a better place to live in if it 
provides opportunities like the Saturday morning classes to all inquisitive 
persons. 

In a long career leading from his 1931 graduation at the University of 
Chicago to the 1976 Summer Institute at the same University Professor 
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Arnold E. Ross developed and exercised his own talents as a teacher and 
“conductor” of the mathematical community. 

His personal influence and the excellence of his building is much in 
evidence at the University of Notre Dame and at The Ohio State University 
and will remain so after his retirement. Through his Summer Institute, many 
gifted students of the State of Ohio as well as of every other State of the union 
and also of the Canadian provinces of Ontario and Quebec have profited 
immensely. What is more important, the ideas of Arnold E. Ross continue to 
develop their infectious quality in far away countries like Australia, India, 
and Germany, and his example and personal stimulation remain alive with 
colleagues like A. Woods, J. Yaqub, D. Ray-Chaudhuri, J. Riner, T. Ralley, 
and many others, including myself. 


HANS ZASSENHAUS 
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At the age of about 15 (a pupil of the only Mittelschule for girls in Linz, 
Upper Austria) I came to realize that science and mathematics were to be 
my subjects. Slowly this changed to “mainly mathematics,” with science still 
of great interest to this day, and in due course mathematics meant mainly 
number theory. 

I became a student of the University of Vienna where I received my 
doctoral degree, with a dissertation entitled “Uber eine Verscharfung des 
Hauptidealsatzes.” My supervisor was P. Furtwangler who had established 
the main facts about the Hilbert class field. Hilbert had himself stated these 
facts, based only on his research of relative quadratic extension fields. Only 
very few of Hilbert’s statements were proved to be incorrect, among them 
was the statement that the class field had class number equal to 1. How- 
ever, the weaker statement, the Hauptidealsatz, was proved correct during 
my student days by Furtwdngler too, using the newly found technique of 
translating the assertion into a statement about finite groups with abelian 
commutator subgroups. With feasible problems in class field theory being 
very scarce Furtwangler was delighted to have plenty of thesis problems 
available now by using the same technique. His idea was to study the 
fields between the given field and the class field to see which and how 
many ideal classes of the ground field would become principal in such a field, 
1.6., a generalization of Hilbert’s Theorem 94. Unfortunately, this did not 
turn out to be a good idea. The technique did not work well for the inter- 
mediate fields and the theorems seemed to be of a chaotic nature. The results 
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of my thesis problem made Furtwangler give up class field theory forever 
after. Artin called the problems hopeless. However, I myself returned to 
them in quite recent years, based on a group theoretic proof of Theorem 94 
by M. Hall. I gave this new research the title “Hilbert’s Theorem 94,” and 
several, mainly young people, have contributed toward it. At the time of my 
recent Ph.D., however, I made only two further contributions which had a 
certain influence many years later. One was my collaboration, mainly by 
correspondence, with A. Scholz on our investigation of Theorem 94 for 
relative cubic unramified extension fields of a number of imaginary quadra- 
tic fields. In this case exactly one class becomes principal (a fact that Scholz 
denoted as “capitulation”). Further, we examined the group theoretic pos- 
sibilities for the group of the second class field for fields with 3-class group 
with two basis elements, assuming that only one ideal class becomes princi- 
pal. The second contribution concerned a group theoretic proof for the fact 
that for the 2-class group a group of type (2, 2), the class field has an odd 
class number. This fact, the result of the joint work with Scholz, and the fact 
that a cyclic p-class group always leads to a class field with class number 
prime to p, led to a conjecture concerning group towers whose positive 
answer would have implied that every class field tower is finite. However, the 
group tower conjecture was defeated on group theoretic grounds and finally 
Safarevic (together with Golod) showed the existence of an infinite group 
tower for a numerical case. 

Among other subjects of particular interest during my student days was 
the work of Menger on metric geometry and mathematical logic. I obtained 
some results on the former subject and participated in the Wiener Kreis led 
by Schlick. 

During the following years I obtained a position at the Mathematics 
Institute of Gottingen University as one of the editors of Hilbert’s work in 
number theory and then returned to Wien as assistant to Hahn and Menger. 
I supervised a thesis on multiply monotone sequences, learning functional 
analysis in this way. At that time two important changes happened to me. 
On the mathematical side I became interested in topological algebra and 
“sums of squares,” on the career side I obtained a scholarship at Bryn Mawr 
College and a fellowship at Girton College, Cambridge, England. Topologi- 
cal algebra came to me via Pontrjagin’s work on topological fields and some 
fascinating problems posed by van der Waerden in the Jahresber. deutsch. 
Math. Vereini. In a joint paper Jacobson and I generalized Pontrjagin’s 
work to locally compact rings, and I published the first topological proof 
(via spheres) of Frobenius’ theorem concerning hypercomplex systems over 
the reals which are also division algebras. Later I proved that the Laplace 
differential equations in n dimensions for n functions can be deduced 
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from generalized Cauchy-Riemann equations only for n= 1, 2, 4, 8. This 
was reproved by Stiefel and plays a certain role in combinatories now. 

Jobs were very scarce in Great Britain then, even for British-born people 
and even more for the many foreigners taking refuge there. I obtained a 
junior teaching position in the University of London. It was then that I 
met John Todd (Jack) employed in a similar position at King’s College, 
University of London. We got married in 1938. 

In due course the war broke out and life became very difficult, e.g., we 
had to move 18 times during the war. The first year of the war was spent at 
Belfast where I supervised a young theologian, E. Best, who was also very 
much interested in mathematics. Jointly we started on the investigation of 
finite groups for which the concept of “normal subgroup” is transitive. This 
came about through a suggestion of John Todd. This work was continued in 
Italy (Zacher, Zappa) and in Germany (Gaschiitz, Huppert). During my 
time in Belfast I was cut off from my own mathematical papers, but the 
library of Queen’s University there was very helpful to my needs. I saw there 
among other things the papers of McCoy on matrices with property P and 
the papers of Latimer and MacDuffee on classes of matrices. Both these 
rather different basic sets of ideas have stayed with me ever since and have 
led to substantial research on my part. I will come back to this soon. 

My London college moved to Oxford during the war, a city expected to 
be safe from air raids. However, life there was very complicated because of 
the shortage of housing for just the reason of safety. In some ways I preferred 
living in London in spite of the bomb danger. My teaching duties became 
heavier, but not more interesting. My husband had a scientific research job 
with the British Admiralty in Portsmouth and later in London. I myself 
obtained a research job with the Ministry of Aircraft Production in 1943. It 
was there that I became fully interested in matrix theory for I worked in the 
so-called Flutter Group, under the matrix expert R. A: Frazer. It was impos- 
sible not to come under the spell of matrix theory in that group. My interest 
in bounds for characteristic roots of matrices, in particular in GerSgorin’s 
theorem and in criteria for stable matrices, arose out of this phase of my life. 
To this I will return. However, I myself was asked to work on boundary 
value problems for hyperbolic differential equations arising from flutter at 
supersonic speed on which Temple had already made progress. There were 
new problems, and my particular problem was mentioned in Hadamard’s 
book, “Théorie des Ondes,” in a footnote, as not arising in practical work. 
After a long struggle I managed to solve it and I published it after the war in 
a book dedicated to Courant on his 60th birthday. 

In 1947 my husband accepted an invitatfton to the USA to the National 
Bureau of Standards to work toward the exploitation of high speed elec- 
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tronic computing machines which were under construction then. I accom- 
panied him and was soon employed myself there. Our chief, John H. Curtiss, 
had much understanding for my professional possibilities. 

During this year we were allowed to spend one term at the Institute for 
Advanced Study where we were appointed as members. We were attached to 
the von Neumann group. It was there that I started to shake off my wartime 
weariness and also my wartime research interests. The latter, however, were 
still helpful in my work at the National Bureau of Standards where we were 
called back a year later and where we stayed altogether for 10 years. 
Although I was given much freedom there, on the whole, I was supposed to 
devote myself to problems that could be applied to practical work. I ac- 
cepted part-time payment, partly to be able to return to number theory, 
partly to avoid the fixed civil service hours. During my time at Princeton I 
had contacts with Artin, Chowla, Reiner, Schafer, among others. In particu- 
lar a conversation with Chowla brought me back to the theorems of Latimer 
and MacDuffee on classes of matrices. At the end of my stay I gave a lecture 
at Harvard on my boundary value problem and then proceeded to Los 
Angeles, to work with Jack at the newly opened Institute for Numerical 
Analysis (INA), a field station of the National Bureau of Standards at 
UCLA. After a very cold winter in Princeton the California climate helped 
to restore me even more and my new research interests became even clearer 
to me: eigenvalues of polynomials in several matrices on the one hand, 
integral matrices on the other. For the first a suggestion by Mark Kac was 
most helpful. It was the definition of the L-property (L for linear) for pairs of 
matrices. This property means that in the pencil generated by the pair, the 
eigenvalues are linear functions. When the very gifted mathematician 
T. S. Motzkin joined INA, I told him of my partial results. We soon 
started on joint publications and proved some general facts on matrices with 
multiple eigenvalues in a pencil and a theorem (suggested to us by Kaplansky) 
concerning a pencil in which every matrix is diagonable. Motzkin in- 
troduced methods of algebraic geometry into this subject. I myself realized 
its close connection to complex function theory; Kato reproved our result 
via perturbation methods and included it in his big book on this subject. 
Later on I published further work on the L-property by myself and sug- 
gested problems to others. 

I have at all times suggested problems to others, and I had even published 
several in the section on research problems in the Bulletin. Five of those 
led to published research. One of them concerned the infinite Hilbert matrix. 
This matrix does not have z as an eigenvalue if only vectors in the 
usual Hilbert space are admitted. However, I suggested to prove that x was 
still an eigenvalue for vectors outside this space. This was shown by Kato 
and further elaborated by M. Rosenblum. 
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While still at the Bureau of Standards, I continued research on the 
theorem of Latimer and MacDuffee and started work on units in the integral 
group ring. I became interested in this via Higman’s thesis. Later I treated 
them via unimodular group matrices. I introduced M. Newman into this 
subject, and we published several papers on it; I was particularly concerned 
with the positive definite case. 

In 1957 we both accepted positions at California Institute of Technology 
and it seemed to me as if an odyssey of 20 years (I left Cambridge, England 
in 1937) had ended. I could at last work again with academic freedom and 
have Ph.D. students. I had eleven there. At the National Bureau of 
Standards I had one only, K. Goldberg, attached to the close-by American 
University. He worked on the Hausdorff formula. Otherwise I worked with 
several postgraduates like A. J. Hoffman and with visitors: Ostrowski, 
Wielandt, K. Fan, H. Cohn, J. C. P. Miller, P. Stein, Stiefel. 

When still at the National Bureau of Standards I was able to use a 
theorem by Shoda which I had cherished for a long time. It concerned 
matrices with trace =0 or det = 1. Now at Caltech my student R. C. 
Thompson generalized this theorem to all possible fields. Another student, 
C. Hobby, with the help of Zassenhaus, contributed greatly to the solution of 
the class field tower problem. His work was completed by Serre. 

Several scholars of immense power came to Caltech during my stay and 
worked with me a great deal. After years of frequent isolation from math- 
ematicians working in my line, or sometimes from mathematicians 
altogether, sometimes in positions with hard duties, this was most beneficial 
for me. These were particularly in number theory Zassenhaus, Dade, and A. 
Froehlich. They were really interested in my problems on integral matrices 
and raised my understanding of my own ideas. Unfortunately, in previous 
times, even with brilliant mathematicians as colleagues I could not force 
myself to break away from my own problems, even if my progress was 
frustrating at times. Even in my student days I was alone in my number 
theoretic work. In Europe in the old days one did not call on one’s supervi- 
sor freely. However, this attitude helped me to develop my own problems. In 
recent years a young man Kisilevsky has been a brilliant colleague. In any 
case working with the gifted Caltech students is a great stimulus. 

Again, at Caltech, like at the Bureau of Standards, the chairman of the 
department, Frederic H. Bohnenblust, set up my job to give me much 
freedom. 

I want to return to my work on matrices at Caltech. I continued on the 
L-property, but still stimulated by my experience in aerodynamics I made 
progress on the algebraic treatment of Lyapunov’s stability criterion. This 
led to considerable work by others. Here I obtained much stimulus from 
W. Givens and from H. Schneider. My contacts with the powerful matrix 
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school of Marvin Marcus are very beneficial at all times. Work with Wielandt 
who visited Caltech was also very beneficial. 

Apart from work on the L-property I was active in other parts of matrix 
theory, other generalizations of commutativity, and in particular commuta- 
tors. The thesis of Gaines fits in here; that of Parker is in another area of 
matrix theory. 

At times work on matrices leads to analogous investigations in number 
theory and for real matrices. This was evident in my own work in two 
different areas: (1) Positive definite real symmetric or hermitian matrices. 
Here, e.g., my work connected with the Lyapunov theorem and two theses 
(C. Johnson, R. Loewy) fits in. On the other hand, in number theory 1 
worked on positive definite group matrices (i.e., integral “symmetric,” “posi- 
tive definite” group ring elements) and on integral symmetric matrices (this 
is linked to positive definite similarity). For the rational case there is the E. 
Bender thesis, recently greatly extended by him. (2) Factorization into sym- 
metric factors (thesis of Uhlig) for real matrices and into integral symmetric 
matrices for integral matrices (so far mainly for n = 2) by myself. This last 
work is linked with norms from quadratic fields, as well as in my quite recent 
work on rational 2 x 2 commutators AB — BA. 

Among my matrix work there is a commutator result on which I made 
some observations in a paper of 1961 (with elegant follow-up by Zassenhaus) 
which seems to have stimulated analysts like De Prima and his students, 
Putnam, and Berberian, although its origin is from classical finite group 
theory. It concerns unitary matrices whose eigenvalues lie on less than a 
semicircle, called cramped matrices now. 

While I myself found no occasion to return to the GerSgorin theorem, 
my early light contributions and problems were taken up by several 
colleagues, in particular A. J. Hoffman, P. Henrici, J. Todd, and R. S. Varga, 
who arrived at important results. 

Of my other number theory students—L. Foster, D. Davis, and D. Maurer 
—the first two based their investigations on tables obtained by computers. 
I myself have been much interested in computational number theory, a 
subject in which my co-author A. Scholz—mentioned earlier—was a master. 
For my work on integral 2 x 2 matrices the tables of E. L. Ince on class 
groups and units in real quadratic fields are a wonderful help. 

Among other results obtained during my Caltech time is a “sum of 
squares” result, concerning a new 8-square identity. This was generalized by 
Eichhorn and Zassenhaus to a 16-square identity. 
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We consider a lattice L in an octave algebra V, and an even integral quadratic form 4, defined on L. 
It will be shown that δ᾽. ει, qay=n (a7*, 6) is O(n** 3) if k = 1, 2, 3, and is O(n**>/?**) for any ¢ > 0 
if k 2 4. 


Introduction 


It is known that the number of variables of an even integral quadratic 
form with determinant unity must be divisible by 8. It is also known that 
there is, up to equivalence, exactly one such form of eight variables. We 
consider an octave algebra V over the field of real numbers, the underlying 
space of which 15 an eight-dimensional real vector space, equipped with a 
(multiplicative) norm 4: V > R. A scalar product on V is defined by (x, y) = 
q(x + y) — q(x) — q(y). It will turn out that we can choose a lattice Lc V 
such that q defines on L an even integral quadratic form with determinant 
unity. Let F be a spherical function with respect to g. By a theorem due to 
Schoeneberg (see [3]) we know that the theta series 

Q(t; F)= Σ᾽ F(aje?™" 
aeL 
(where t is a complex variable whose imaginary part is positive) is a modular 
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form of weight (4 + deg(F)) for the full modular group. If deg(F) > 0, it is 
even a cusp form. Define for any positive integer k 


A,n)= > (ae), n2O. 
aeL, q(a)=n 
If k is odd we have A,(n) = 0. We will consider A,,(n). It will be shown that 
by choosing special spherical functions F we obtain a relation between 
A,,(n) and the coefficients of certain cusp forms. This will enable us to 
provet 


Theorem For any <> 0 
A>,(n) = p,n*o3(n) + O(n**??*°), 
where p, = —360, p, = 144, p; = -- 24, and p, = 0 for all k 2 4. 


In fact, this theorem states that for k 2 4 the geometrical distribution of the 
2kth powers of octaves in L of norm n is utterly dissimilar to this distribution 
for k <3. 

The contents are as follows. First we recall some facts about spherical 
functions, most of which may be found in Hecke [2, pp. 849-853]. Next we 
describe how x?* can be expressed in terms of x and (the unit element) e, and 
finally the proof will be carried out. 


1. Let V be a finite-dimensional real vector space of even dimension 2n, 
and let (,): V x V— R bea scalar product. We write q(x) = 3(x, x) for any 
x € V. A homogeneous polynomial function F is said to be a spherical 
function with respect to q if 


22Ε 0*F 
aot τῳ 513 = 0, where x =X,@; ἘΠ + X2n@2n 
1 2n 


for some orthonormal basis e,,..., @2, of V. Hecke [2, p. 853] has shown that 
we can obtain all spherical functions of degree k by taking linear combina- 
tions of the polynomials 


(x, y) 
(a(x da)" 5:5 
q(x)a(y) 
for all parameters y ε V. Here H, is a polynomial in one variable T satisfy- 
ing the differential equation 


dH 
wk (Qn — 1)T Ot + (Qn + k — 2)H, = 0. (1) 


2 ἀξ 
(1 -- ΤΊ) Fp dT 


+ This question was raised by F. van der Blij. 
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From 
1 


[ A(T)A(T\1- ΤΉ) ΣΤ πὸ if Καὶ (2) 
-1 
it is easy to deduce that H,(T) is up to a scalar multiple equal to 
I d* 2\kK+n~-3/2 
{-- Py3? a [(1 — ΤΊ) ] (3) 
We normalize H, such that the coefficient of T* is equal to 1. In particular we 
get 
H,(T) = 1, H,(T)= I, H,(T) = ms 
τϑ.. 6n + 3 
(2n + 1)(2n + 2) 


We shall need only spherical functions of even degree 2k. We have, for k 2 1, 
2κ-ἰ 


H,,(T) = T* — w., + X Yj, 2H (T), (4) 
jz 


H,(T) = T. 


where y; >, = 0 if j is odd. Using (2) we can determine the constant w,. We 
have 


1 1 
0 = | H,,(T)Ho(T)(1 — ΤῊ)". 3/2 dT = | (T?* — w,,)(1 — ΤΊ" 32 aT, 
=] 


= 1 
which implies 
= Γ(Κκ + 4) (n) (5) 
Γ(κ + n)I'(5) 
2. Let V = R®, considered as an algebra of octaves. As a general refer- 
ence for octaves we use Coxeter [1]. Let e;, 1 <i <8, be the standard basis 
for V, and let the multiplication in V be fixed such that e, = e is the unit 


element of V. We define for x = (x,, ..., xg) € V, g(x) =x} +-+°° + χξ. 
The following formula holds: 


M5 


x? = (x, e)x — q(x)e. (6) 
Furthermore gq is multiplicative, i.e., 
q(xy) = q(x)q(y). (7) 


From (6) we can easily deduce that x7* = P,,x + Q>,e, where P,, and Q,, 
are polynomials in the two variables X = (x, e) and Y = q(x). We have 
P4(X, Y) = Xx, Q,(X, Y) — ἐπ ¥. 
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The following recursion formulas hold 
Py, = XP 4-2 — YPx,-2+ XQx%-2, QO, = —XYPx,-2— YQOu-2- 


(8) 
We conclude from (8) that P, and Q,, must be of the form 
P,,(X, Y)= on ὌΧ ΤΥ. 
ae (9) 
Q>(X, Y) = Y 2 Bar 2X RM?" 
By induction it can easily be proved that 
O21 x = (— ἫΝ Ε 7 ᾿ 
and 
Bos = τη 7. 4} 0<I1<k-1. (10) 
For convenience we put αι, », =0= Bz, χκ {{| «0 or! 2k. We get 
(x, ὁ) = Sofa, οι + 2Bar-a, ales e/a (11) 


3. Let R c V be the subset 


Σ v(i) even}. 


t= 


pie 
R= (te tel <i<j<iuf; δ᾽ (—1)e; 
i=1 


Ris ἃ root system of type Eg. Let L be the lattice generated in V by $,/2 R.A 
straightforward calculation shows that g(L) «- Z, and q defines on La qua- 
dratic form with determinant unity. It may be found in Hecke [2, p. 868] that 
for any positive integer n 


#{a ε L|q(a) = n} = 240σ 3(η), (12) 


where o3(n) = diay, 4“. 
Now we take F,(x) = q(x)*H 2,((x, e)/2,/q(x)), a spherical function of 


degree 2k. We know that ©(t; Ε 2.) is a cusp form of weight 4 + k. We define 


Ba)= Σ, Fald=nt 5 Ha?) 


Since there are no nonzero cusp forms of nee 6, 8, 10, we have 
B,(n) = B,(n) = B,(n) = 0. (13) 
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By (4) and (12) we get 


1 k-1 τ 
Βφκ(μ) = 521 > (a, e)** + » Y21, on" 'B.,(n) — 240n‘w.,.0 3(n), 
aeL i=1 | 
which implies 


k 
2 7 (a, 6) ἜΣΟΣΝ ».n*'B,(n) + 240n*27*w,,03(n) 
aeéeL, q{a)=n t= 


for some constants 65, »,, 1 <1 <k. Substituting this in (11) yields 


k-1 k-l 


A,(n) = Σ Σ [o> ax + 2B2)- 2, Δι]δ,, Δ2κ-- προ 1Β,{π} 
1=0 j=1 


k 
+ 240n“o3(n) Σ᾽ [ἀλι, 2% + 2B 21-2, an ]27*~ 024-94 - 
1=0 


Suppose now k 2 4 and consider the last sum. By (5) and (10) this sum 
equals 


k Vy τς = [S1)! 
ΡΣ εν (k— 143)! (k—D! 
12(k — 4)! « 


eam BOOM a) 8 


as can easily be proved by writing Τὸ ‘(1 + T)* = δ ({)T* τα differ- 
entiating both sides k — 4 times, and substituting T = i" So Aa,(n )is a 
combination of n‘~/B,,(n), provided k 24, and since the B,,(n) are 
coefficients of cusp forms of weight j + 4, this establishes the part of the 
theorem for k 2 4. 

As for k = 1, 2, 3, it follows immediately from (13) that 


A,(n) = —360no;3(n), 
A,(n) = 144n*o,3(n), 
A,(n) = —24n°a;(n), 


which establishes the remainder of the theorem. 
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On the Product of Three 
Inhomogeneous Linear Forms 


R. P. BAMBAH A. C. WOODS*+ 
PANJAB UNIVERSITY THE OHIO STATE UNIVERSITY 
CHANDIGARH, INDIA COLUMBUS, OHIO 


For i = 1, 2, 3 let L; = αι μι + a;.u. + a;3 U3 be linear forms in the variables u,, u,, u3 with real 
coefficients αἰ; satisfying det(a, ,) = 1. It is proved that given any real numbers c,, C2, C3 there exist 
integral values of the variables u,, u,, u3 such that simultaneously 


3 
[[|2,+¢| <4 and Li, +c, <0. 
i=1 


1. Introduction 


For 1 Si Sn, let L; = a;,;u, +++: + a,,u, be n linear forms in the var- 
lables u,, ..., u, with real coefficients α;; satisfying det(a,;) = 1. A classical 
conjecture of Minkowski asserts that, given any real numbers c,, ..., c,, the 
inequality 


I] [1 + ¢;| τ 5 
i=1 


+ The research of the second author was partially supported by National Science Founda- 
tion Grant GP43919. 
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has a solution in integral values of the variables. This has been proved only 
for n <5; see Skubenko [4]. Stemming from this problem is the conjecture 
that, given any real numbers c,, ..., c,,, the inequalities 
[]|Z4+¢|<2°°', L,+¢,<0 
i=1 
have a common solution in integral values of the variables. This has been 
proved by Cole [2] in the case n = 2. Our aim is to present a proof in the case 
n = 3, using a method introduced by Birch and Swinnerton-Dyer [1]. 

Let A; = (a1;, ..., Gj) for 1 <j <n and denote by A the lattice of basis 
A,, ..., A, so that d(A) = 1. Set m(A) = inf |x, --- x,| extended over all 
points (x,, ..., x, of A other than the origin O. Paralleling the work of Birch 
and Swinnerton-Dyer [1], it is sufficient to consider only those lattices whose 
homogeneous minimum m(A) is positive and attained by some point of A 
provided it is shown that the strict inequality holds in this case. An alterna- 
tive proof of this reduction will be given elsewhere using the divided cell 
technique. In what follows it is assumed that A has these properties. 


2. The Main Lemmas 
Let a be a positive real number with αὐ <4. Denote by K, the set of 
points (x, y) that satisfy an inequality of the form 
Ix|¥o + |¥|x0 < 4/a 


where Xo, Yo are positive numbers with 


[14 [31 4 
Xo + Yo 5 414 ian and Xoo = =. 


Similarly, denote by K, the set of points (x, y) that satisfy 
Ix|¥o + |y|xXo < 4/a 


where Xo, Yo are positive numbers with 


3 1 4 
Xo + Vo <4 [25 and MOOS oe 


Put 6 = 6(,/2 — 1)/7. 


Lemma 1 Let L be a lattice in the plane with d(L) = 4/a and such that no 
two points of L are less than a distance Ji a apart. Then there exist positive 
numbers €, ἡ such that the parallelogram Π given by |x|n + |y|E <4/a= ξη 
is L-admissible and lies in K, if a? 290, in K, if a’ <8. 


Let S denote the set of points (x, y) such that there exists a half-open 


ON THE PRODUCT OF THREE INHOMOGENEOUS LINEAR FORMS 9 


interval I of length a for which t € I implies 
(1) |[(x+t)(y+c(t-—a)| <1 and (2) t-a<0O. 


Similarly, let S’ denote the set of points (x, y) such that there exists a half- 
open interval I of length a for which t € I — {0} implies (1) and (2). 


Lemma 2 (a) [{α 20, then K, Ξ 5 and B(K,)SS’. 
(b) Ifa’ <0, then K,¢S and Β(Κ,) « δ΄. 


3. Proof of the Theorem 


Before proving these lemmas we show how the theorem follows. We first 
restate the theorem in a form convenient for our purpose. Let A be a lattice 
in κ΄ with d(A) = 4. Again assume that the homogeneous minimum m(A) is 
attained and nonzero. 


Theorem Given any real numbers xo, yo, Zo, there exists a point (x, y, 2) 
of A such that 


[(χ + Xo)(y + yol(zZ+20)| <1 and z2+2,<0. 


Proof Let X ε A be such that m(A) is attained at + X. Applying a map 
of the form x -- Ax, y—> py, z—> |Au|~ *z, where λ # 0, μ # 0, it is clear that 
we may assume either X or —X is the point (a, a, a) where a> 0. By a 
well-known result of Davenport [3] it follows that (i) αὖ <4 and (ii) the 
projection L of A into the plane z = 0 parallel to the vector (a, a, a) is a 
lattice of determinant 4/a with no two of its points less than . /4¢ a apart. By 
Lemma 1 there exist positive numbers €, ἡ such that the parallelogram IT 
given by |x|n + |y|& < 4/a = én is L-admissible. By a classical result of 
Minkowski, any coplanar translates of II and Leither have a common point 
or their boundaries have at least two common points. The projection of 
A + (Xo, Yo» Zo) onto the plane z = —a parallel to the vector (a, a, a) yields 
a translate of 1, Hence, either there exists a point (x*, y*, —a) of this trans- 
late such that (x*, y*) € II, or there exist two distinct points (xt, yf, —a), 
(x}, v3, —a) of this translate such that (x%, yt) and (x¥, y$) are on the 
boundary of IT. In the first case Lemma 2 implies that (x*, y*) € S and, since 
(a, a,a)e A, the theorem is true. In the second case (x*¥, y¥) eS’ and 
(x$, y$) ε δ΄. By similar reasoning to that used in the first case, either the 
theorem is true or (xf, y¥,-—a) and (x%¥, y$, —a) both lie in 
A + (X09, Yo, Zo), which would immediately imply that m(A) = 0, the case we 
have excluded. This completes the proof of the theorem. 
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4. Proof of Lemma 1 


Let c be the largest positive number such that the parallelogram P given 
by |x| + |y| <c is L-admissible. 

Casel c2 2/./a Let II be the parallelogram obtained by putting 
€=n= 2/./a so that ITI is L-admissible and €n = 4/a. Then 


4 14 3 
pee Fae ee 
ξ- η Ts 3 a+ 42 
since (a?/? — 2. /34)? > 0. Thus TT lies in K,. If a? <9, then, since 0 < 3, 


4/./a «.4.,3α.2 
and II lies in K,, which completes this case. 
Case2 c< 2//a Denote by (x*, y*) one of the points of L that neces- 
sarily lie on the boundary of P. The symmetry of the lemma allows us to 
suppose that y* 2 x* 20. By hypothesis 


x¥? 4 y#? 5 3342. 
Let &, ἡ be positive numbers such that 
(i) ἔξη = 4/a, 
(ii) € 5 ἡ, and 
(iii) the parallelogram Π given by |x|4 + |y|& < 4/a contains (x*, y*) 
on its boundary. 


The existence of such a pair €, ἡ follows from the fact that P contains 
(x*, y*) on its boundary and c < 2/./a. We claim that this choice of €, 7 
satisfies the conclusion of Lemma 1. In the first place, II is L-admissible, for 
otherwise II contains a point of L linearly independent of (x*, y*) in its 
interior, which implies that d(L) < 4/a, a contradiction. Assume first that 
αὐ > θ and, by way of contradiction, that 


[14 a 
ἕ + n> 3 a+4 4° : 
Since ζη = 4/a, € 2 n, JH α4. ἕνα 2 = 4/a and 4. (ἦα 2 > J#a it fol- 
lows that ξ > 4. /3,a~? and  < 33 α. 
The line xn+ yé=4/a meets the line x=y in the point 
(4/a(E +), 4/a(E + 7)) whose distance from O is 
4/2 “ὦ ς ff ᾿ 
α(ξ -- ἢ al. /#a4+ 4/407?) 3 
since θ < a>. Hence the line xy + yé = 4/a meets the axis x = 0 and the line 
x = y in points both less than a distance ./*;* a from O. Since y* 2 x*, this 
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does not permit (x*, y*) to lie on the boundary of I, a contradiction. There- 
fore we may assume αὐ < @ and, by way of contradiction, that 
3 1 


ry elle 
¢+n> 7a’ 


3 3 1 3 3 1 
* _ ἐν oe) ΤῸ x oes eo nae 
: 2 /5+2/3 δ): : 2/3 7 oi 


so that, in particular, ἢ = 4/a, €* 2 n*, and €* + n* = 4, /3 α΄ *. The line 
xn* + y¢* = 4/a meets the line x = y in the point (νῷ a, /4a) which is at 
distance ./44.a from O. The same line meets x = 0 in the point (0, 4/af*) 
whose distance from O is 


Put 


eee as 
act* 3 
since αὐ <9. By the initial assumption, € > €* and ἡ < n* and so the line 
xn + y€ = 4/a meets x = 0 and x = y in points both less than a distance 
./4# a from O. Again this implies that (x*, y*) cannot lie on the boundary of 
II, the contradiction that proves Lemma 1. 


5. Proof of Lemma 2 


Let (x, y) € CL(K,) αὐ 26 € CL(K,) if αὐ <6. Let x9, yo bea pair of 
numbers that can be associated with x, y in the definition of K, and K, so 
that |x|yo + |y|xo <4/a=Xoyo and, therefore, by the arithmetic— 
geometric mean inequality 


|xya| <1 (1) 


with strict inequality unless (x, y) is not in either K, or K,. Thus, if 
|xya| = 1, the hypothesis of the lemma allows us to exclude the value t = 0. 
We shall make repeated use of this fact in what follows without expressly 
calling attention to it. From the symmetry in x, y we may assume |x| < 
|y|. Further, |y| <4/axo = yo, so that an upper bound for yo is also an 
upper bound for y. If (x, y) ε CL(K,), then 


yo <4. /Feam? = (1.8516...)a~?. 
On the other hand, if (x, y) ε CL(K,), then 


Yo < (2/3 + 2,/3 — @)a-? < 4, /3a7? = (2.6186...)a~?. 


Thus, in either case, yo < 4.,3 α΄ 2. 
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Case A x <0 Put f(t) =(x 4+ t)(y + t)(t — a). We shall show that if 
0<t<a, then | f(t)| < 1. We distinguish cases: 


(i) x <—a In/f(t) the coefficient of t is xy — ay — ax = c(t), say. If 
y <0, then c(t) > 0 since all terms are nonnegative. If, however, y > 0, then 
c(t) < Osince | y| = |x|. Hence f(t) is monotone on [0, a]. Since | f(0)| <1 
and f(a) = 0, the result follows. 

(ii) —a<x<0O and y>0O Again c(t) <0 so that f(t) is monotone 
decreasing on [0, —x]. Since | f(0)| <1 and f(—x) = 0 it follows that 
| f(t)| < 1 on (0, —x]. On the other hand, if ε [—x, a), then 


= |(t+ x(t + y(t —a)| <4a7({y| +a) « 1. 


(iii) —a<x <0 and y <0 Again c(t) 20 so f(t) is monotone in- 
creasing on [0, —x] and, as before, | f(t)| <1 when te (0, —x]. If 
t e [—x, a), we repeat the argument under (ii) to obtain | f(t)| < 1. 


CaseB x2a f(t) has a single extremum in [-- χ, a] and therefore is 
monotone on at least one of the two intervals [-- χ, 0] or [0, a], and so the 
result follows as before. 

We must now distinguish between the cases αὖ > θ and αὖ <0. Assume 
for the time being that αὖ 2 @ so that, in particular, 


Yo <4,/qga"? 
CaseC O0<x<Ja For0<t<a we have 
| (x + t)(y + t)(t — a)| = |yo(x + χοίν + t)(t — a)/X0 yo | 
<4(yo(|x| +t) + xo(|y| + ἢ) rere 
«(απ 4t(. Ata + 4. α΄ 2))2α(α — ἡ), 
and, putting t = λα, so that 0 « 4 « 1 this is 
(1+ ἐλ, fa? + 4/8) (1 — A) < (1 + sA)?(1 — A) = g(A), say, 
where s = 4 /MB + /% <.772. Then σ΄ (λ) = (1 + sd)(2s — 1 — 38d) «0 if 


1 <j) < 1. Since g(1) = 0 and g(4) < 1, it follows that when 3a <t « awedo 
have 


[(χ + t)(y + ἡ(ε -- α)} <1 


as desired. Now suppose 0 < t <4a, so that 0 <A <4. Put x = μα so that 
0 <p <4. Then 


| (x + t)(a — t)| = a?(A + μ)( — A) <4a7(1 + μ):, 
whereas 


Xo Vo Ξ ΙΧ 
oS - τὶ Y=, - ἐμαῦγᾷ. 
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Hence 
| (x + t)(y + t)(t — a)| <2(1 + μ)λ(αὖγο — ἀμαΐ γὸ + a°A) 
<4(1 + μ)ξ(αἦγο — guatys + 30°) 
<4(1 + μ)λ(αἦγο — au(a’yo)” + 4). 
But a*yo ge 7x and, as a function of z, z—ipz? has a maximum at 
z= 2/u> 4,./7qz. Therefore 


ΠΣ aie dcdiacditdsiabcabh se 


Computing, we find that h’(u) 2 0 if0 <p <4 and also h(5) < 1. This com- 
pletes Case C. 


CaseD 4ta<x <3 38 Put x = μα, so that $ <p Ξ ἢ Assume first that 
y <0. We shall show that the interval 0 < t < a will ἘΠῊΝ To begin 


(x + ἡ)(α -- ἢ) <}#(a+ x)’. 
Thus, if y 2 —a, then 
[(χ + t)(y + t)(t — a)| <ja(a+ x)? <a’ <1. 
On the other hand, if y < —a, then 
N(x τὴν + t)(t — a)| <4|y|(a + x) 
<4(1 + μ)᾽ (αὖγο — 4u(a7yo)*) 
< 4(1 + μ)λᾳ4. ἶ — ἐμ(4.., })2) = i(u), 
Computing as before, i’(u) is positive and so i() is monotone increasing in 
, 3] and i(2) < 1, which takes care of y < 0. Now assume that y 2 0. We 


show ey the sae (-- -- μ)ὰ <t < (ἢ — μ)α will suffice. Put t = Aa so 
that --ᾧ <A+y< 3 Suppase that 0 <1 + yw <2. Then with 


i) [(x + t)(y + t)(t — a)| = (A + μ)Ὰ — A) | (ay + αὖλ)]}, 


if a2y+a°A <0 it is clear that f(t)<1, so we may assume that 
a*y + αὖλ >0 and so 


f(t) <(A+ wl — ANAS — Su + αδᾷ - μ))} 


< (A+ μ)( — AMA / + 9 — Py) = J(u), say. 


Computing as before, we find that j’(u) is positive in the range in question 
and so the maximum value of 7(μ) occurs when 4 + yu = 3. Hence 


f(t) <2(1 -- ὦ — Fe + PA) = kA), say. 
Further 


k(A) = 3(1— λγα + BA), where «a=4/3%-% and β- 39, 
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has a maximum value of 
3(α + B)?/16B < 1, 


and so f(t) < 1 when 0 <1 + p <3. Now suppose that —4 <A +p <0. 
Then f(t) is not larger than one of the two expressions 


(a+ μ)α -- λλῖν! and | (A+ (1 — Aja? 

both of which are less than 1 since αὖγ <4,/7 — 4u(4./s4)? and [1] < 1. 
This clears Case D. 

CaseE *<y<1 Assume first that y > 0. Since f(t) has zeros at a 

and —x and has no zero between, it follows that either the intervalO0 <t<a 


will suffice or f(t) < 1 on the interval —x <t <0. But if —a <t < —x, then 
f(t) is not larger than one of the two expressions 


|(x + t)(a — t)y| and \(x + t)(a — t)a| 


both of which are less than 1 since a*y < 2. 

Now suppose that y <0. As in Case D, if y 2 —a, then the interval 
0 < t <a will work. Hence we may assume that y < —a. Again either the 
interval 0 < t <a suffices or f(t) < 1 for —x <t <0. Thus suppose that 
—a<t<-x< ss Then |x + t| <4a, |a—t| <2aand so 


f(t) <4(a?|y| + a?) < (4 ὦ — Gut 9) «1. 
This completes Case E and we have now covered all the possibilities when 
αὐ > 9. Thus, assume from now on that αὐ <9. 


CaseC 0<x<4a Set y*=2,/3+2,/3—s and let 0 <y, [1] < 
be constants. 


Auxiliary Lemma [{ + 2p S /4, then the maximum value of g(s) = 
y* — dyy*? + sd is at s = 0 in the range [0, 51. 


Proof g'(s) = (3 — s)7*/2(A./3 — 5 — 1+ ἔμγ"). By hypothesis 
ee aa )60) "2 < (1 — μι βὴθ -- 5) "52, 

hence A(3 — s)'/? — 1+ 4uy* <0 and σ' (5) <0. This proves the lemma. 

Assume that 0 < « a. Then 

Ff (t) = |(vo(x + t)xoly + t)(a — Χο yo | 
(hyo |x +t] +4xoly+t|)|a—tla 

(α + t,/$a~?)?|a — tla. 
Put t = λα so that 0 < A < 1 and 


f(t) <(1 + 3 ap(1 — A) = nA), say. 


< 
< 
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Computing in the usual way, we find that n’(A) <0 ᾿ A 24. However, 
n(1) =0 and n(3) = (1 + 2171/7)? 2 «1. Hence f(t) «1 if 4 <A <1. So 
assume 0 «λ «3 3. Since with x=ypa, so 0 3 “prt it “follows that 


A+2U<¢< /4. Thus the auxiliary lemma pes and 
f(t) <(A + μ)( — Ala? |y| + λα) 
< (A + μ)( — A)(a?yo — apa*ys + Aa”). 


Since a> <6, so Yo ars 2/3 + 3 — qg> = γῆ, Recalling that z — jyz? is 
monotone increasing for z < 2/y and 2/u 24> y*, so 


f(t) <(A + μ») — ἢ" — auy*? + λα") 
< (A+ pw) -- A)(4./3 — 42) by the auxiliary lemma, 


<4(1 + y)?(4,/3 — 4p) 


by the arithmetic-geometric mean inequality, = p(u) say. The customary 
computation shows that p’(u) > 0 since μ <4 and so p(n) < p(5) < 1. This 
completes Case C. 


CaseD 4a <x <.694a Assume first that y 2 0. Again put x = μα and 
t= Aa so that 4+ <p <.694. We show that the interval for t given by 
— 306 --μ <A < .694 — p will suffice. We have 


f(t) = |e + ἣν + ἡ(α -- ἢ] = [A+ μ)1(1 -- Alla’y + a). 


Here we may assume that αὖγ + a°A > 0 for otherwise, since Lies 1 and 
|A + μ| <.694, it follows easily that f(t) < 1. Further, with y* -,. + 


2./4 — αὐ, the previous argument applies because 2/y > 2/.694 > 4,/3 3. 
Therefore 


f(t) < |(A + w)| (1 — γί" — auy*? + a°d). 
But 2+ 2p < .694 + .694 < J4 and the auxiliary lemma applies. Hence 
f(t) < |Q+ w|i -- λγ4.,  -- Fy). 
Suppose first that O <A + pu <.694, so that 
JA+ μ|{1 -- ἡ Ξ ( τ pl -- ἢ) 


which, as a function of A has a maximum at A = ξ([ — μ). However, A is 
always less than this value since 


2A + pw <.694 + (694 — 5) « 1. 
Hence we are justified in replacing 4 + μ by .694 and so 


f (t) <.694(.306 + u)(4,/3 — 12μ) = q(u), say. 
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The customary computation of the maximum of 4(μ) yields 
q(u) < .694(1 + .306./3)? = .999899... < 1. 
Suppose now that —.306 ΞΔ + yw <0. Then 

[A+ u|(l—A)= —A+ μ) -- A), 

and, since 1 < 0, this is a maximum when A = — .306 — yu. Therefore 

f(t) < .306(1.306 + 1)(4,/3 — +44) = r(u), say. 

γ(μ) has its maximum at μ = ἐς ἢ — .653 < 4. Since p > 4 it follows that 

r(u) < .306(1.806)(4,/3 -- $) « 1. 


Thus the lemma holds in this case if y 2 0. 

Now assume that y < 0. Again we may assume that y < —a for other- 
wise, taking 0 < t < a, we have f(t) <2a-a:a<4%< 1. Then f(t) has roots 
—x, a, and —y>a and so either 0<t<a will suffice or f(t) <1 on 
—x <t <0. Hence we may assume that f(t) < 1 on —p <A «0. In addi- 
tion, if « A <.694 — μ, the argument for y > 0 above goes through vir- 
tually unchanged since if a?y + αὖλ is positive then an upper bound is .194a° 
and f(t) <1, whereas if a*y + a°A is nonpositive, then its modulus is 
bounded above by a?|y| <y* — 4yy*? and the auxiliary lemma may be 
applied with A = 0 yielding the same bound as before. Hence we may assume 
that —.306 <1 +p <0. It is then evident that, as a function of A, f(t) is a 
maximum when A + p = —.306. Hence, incorporating previous arguments 


f(t) < .306(1.306 + μ)ν" — quy*? + a? (u + .306)) 


where y* = 2,/3 + 2,/3 -- a’. 
Put μ(α) = y* — 4py*? + a?(u + .306) and, for simplicity of notation, 
a= J3 and β = .306. Then u(a3), considered as a function of αὖ, attains its 


maximum at 
soa 1 — pa\? 
2u+ Bp 


and with this value of αὖ, after a little more computation, 


y* — ἔμγ"ξ + a3(u + 306) = (ua + af + 1)’/(2u + B). 


Hence 
f(t) < βᾷ + B+ μ)ίμα + a + 1)?/(2u + B) = υ(μ) 
say. Now v’(u) has the same sign as 
(μα + αβ +14 2e(1+ B+ μ)μ + B)—2(14+ B+ μ)ίμα + αβ + 1) 


which is a quadratic in with leading coefficient 4a > 0. Further, at μ = 0 its 
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value is «6B? — B ~2 <0 since «= /3 and β = .306. It follows that the 
maximum value of v(u) on 5 < μ < .694 is 


max(v(4), v(.694)) = max(.987..., 989...) <1. 


This completes Case D. 

Case E .694a <x <a _ Assume first that y>0. We show that the 
interval —a < t < Owill suffice. If0 < y <aand —a < t «0, it is trivial that 
f(t) < 1. Thus we assume that y > a. As before, f(t) has roots —x, a, and 
—~y<—aS-—x and so either f(t)<1 on 0<t<a or f(t)<1 on 
—x <t<0. Thus we need to show only that f(t)< 1 on —a<t < —x. 
Again put t = Aa and x = paso that —1 <4 < —yp, and 694 <p < 1. Now 


f(t) = |A + p|(a2y + a®A(1 — A). 
But αὖν + αὖλ <a’y « αΞγο — Ha?yo)? < 1/p, the maximum value in z of 
z — 4uz*, <1/.694. Hence 
f(t) < .306(2)(1/.694) < 1. 
Therefore we may assume from now on that y < 0. Exactly as before we may 
assume that y < --α so that either f(t)< 1 on 0<t<a or f(t) <1 or 


—x <t< 0. Hence it remains to show that f(t) < 1 on —a<t < —x, ie, 
on --ἰ «ἡ Κ --μ. In this case 


f(t) <2(L — p)(a*|y| + αὖ) <2(1 — p)(a?yo — Su(a? yo)? + αὖ). 
Suppose first that .694 «μ <.76. Since 
2/p > 2/.76 > 4,/3, 


so f(t) <2(1 — u)(y* — guy*? + αὖ) and clearly this is greatest when p is 
least, namely .694. However, the case 1 = — 1, μ = .694, y «0 has already 
been considered, and we have shown here that f(t) < 1. Hence it remains to 
consider the possibility that .76 <u < 1. Let —a < t <0. Arguing as before, 
it is sufficient to consider —a < t < —x, ie, —1 <A <—yp and then 


f(t) < [A+ pul] [1—A](a?|y| — αὐλ) 
<|4A+p||i- a\(i +a] < (24)2)( 75 + τ) «1, 
μ 16 7 
and this completes the proof of Lemma 2. 
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On the Degrees of the Sum and Product 
of Two Algebraic Elements 
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If « + B is not a generator of the composite field K(x, B), where K(a) and K(B) are linearly disjoint 
separable algebraic extensions of the field K of degrees m and n, then char K is a divisor of mn or it 
is smaller than 5 min(m, n). If «B does not generate K(a, B), then both fields Κα (α) and K(B) are, 
essentially, radical extensions of K. 


Let K be a field and let K(x) and K(y) be simple separable algebraic 
extensions of K of finite degrees m and ἡ, say, such that [K(x, y):K] = mn. 
The composite field K(x, y) is also a simple extension of K, and experience 
tells us that it is usually generated by x + y. This has been confirmed by 
Isaacs [1] for char K = 0, or char Καὶ = p}mn and p > min(m, n). Another 
proof of a weaker result in a more general context has recently been given in 
[2]. 

The purpose of this article is twofold. Firstly, we shall show that 
K(x, y) = K(x + y) already if p } mn and p > 4 min(m, n). Secondly, we shall 
study the analogous question for the products and give a complete descrip- 
tion of the extensions in which xy does not generate K(x, y). They turn out 
to be, essentially, radical extensions. 


+ Deceased. 
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Let us assume that [K(x, y):K(x + y)] =k > 1. The five fields K, (K(x), 
K(y), K(x + y), and K(x, y) will be said to form a deficient system (with 
deficiency k). Let us denote by x = x1, X2,---)%m and y = Vis Y2s-++> Yn the 
conjugates of x and y. Let us denote by K,, K,, and K,,, the normal 
closures of K(x), K(y), and K(x, y) over K. 

By our assumption, all the Galois groups Gal(K,(x;)/K(x;)) (1 <i < μὴ) 
are transitive on the y’s and the groups Gal(K,(y,)/K(y;)) (1 <j 3 πὴ are 
transitive in the x’s: Therefore, Gal(K,, ,/K) acts transitively on the sums 
x; + y; (1 <i<m, 1 <j <n). Since we assume that deg(x + y) = mn/k, we 
must have a system of mn/k equations of the type 


Xi, + Vj, = Xig FIV = ΞΧ Ἔκ’ (1) 


where the sums of x; + γ᾽ in different equations are distinct and the group 
Gal(K,, ,/K) acts on the equations transitively. 

Two equations of the type (1) will be called directly linked (with respect 
to x) if there is x; that appears in both of them. Two such equations will be 
called linked (with respect to x) if there is a chain of directly linked equations 
connecting them. Contrary to direct linkage, the concept of linkage is an 
equivalence relation and our equations will be partitioned in equivalence 
classes. An equivalence class containing x;, say, in one of the equations will 
contain all equations containing x;. Therefore, the class that contains x = x, 
will contain all equations in which occur some of the conjugates x,, X2,.--; 
Xm,> say. By the transitivity of Gal(K,. ,/K) on the equivalence classes, all 
classes will likewise contain m, conjugates of x, and thus m, |m. Every 
transformation from Gal(K,, ,/K) that carries over x; (1 <i < mj) into x; 
(1 <j <m,) fixes the whole equivalence class containing x, and thus it 
permutes the elements x1, X2,---5 Xm, and fixes the field 


my 

f * ° . eee 

K = ΚΙΣ Xi> Σ ΧΙ ΧΡ ΟΣ X 4X2 mi} 
ἱξ 1 1<i<jsm, 


Therefore, K’ < K(x) and the elements x,, X2, ..-, Xm, are conjugate with 
respect to Κ' and their degree is m,|m. Furthermore, we have K(x) = K’(x), 
K(x, y) = K'(x, y), and K(y) < Κ Ὁ) < K’(x, y). It follows that 


[K’(x, y):K'(y)] = [K'(x):K’] = m 
and 
[K'(x, y):K’(x)] = [K'(y):K’] = π. 
Moreover, [K’(x, y):K(x + y)] = k since the equations (1) are unchanged. 
In this way we have obtained the same situation over the ground field Κ΄ 


as we had over K, namely the deficiency k by which the degree of x + y has 
to be divided is unchanged. Now we shall perform the same kind of reduc- 
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tion over K’ with respect to y. If it happens that we destroy the linkage with 
respect to x, we shall perform the reduction with respect to x again, and if 
necessary, once more with respect to y, etc. After finitely many steps, we shall 
have a ground field Καὶ" < K(x, y) such that 


[K*(x, y):K*(x)] = [K*(y):K*] = n*|m, 
[K*(x, y):K*(y)] = [K*(x):K*] = m*|m, 
[K*(x, y):K*(x + y)]}=k> 1, and all equations (1) 
for the sums x; + y; (1 <i < m*, 1 <j < n*) will be linked both with respect 
to x and y. The five fields K*, K*(x), K*(y), K*(x + y), and K*(x, y) will be 
said to form a primitive deficient system (with parameters p, m*, n*, and k). In 
the following we shall restrict ourselves to primitive systems only. 
Remark If K, K(x), ΚΟ), K(x + y), and K(x, y) form a primitive 
system, there still can be a nontrivial subfield between K and K(x) or K(y). 
Theorem 1 Let K, K(x), ΚΟ), K(x + y), and K(x, y) form a primitive 
system with parameters p= char K, m=[K(x):K], n=[K(y):K], and 
k = [K(x, y):K(x + y)] > 1. If p } mn/k, then K, = K, = K,, ,, and if p 7 m, 
then K, & Ky, i.e., m|(n — 1)}. 
Theorem 2 If K, K(x), K(y), K(x + y), and K(x, y) form a primitive 
system of characteristic p, then p|[K,, y:K]. 


Proof of Theorem 1 Primitive systems have one important property, 
namely every x; (1 <i <m) can be written in the form 


X= xX, + VApy;, Σιλιξθ 8 (1 <i<m) (2) 
j=l 
and every y,; (1 <j <n) can be written as 


Yj=Vit D myx, Σ Hi = 0 (1 <j <n) (3) 
i=1 j 

where the /’s and p’s are rational integers. This is a consequence of the 
two-sided linkage. Note that the representations (2) and (3) may not be 
unique. Using (2) we see that 


tr(x + y) = (mn/k)x, + Y, 
when Ye K,. If p}mmn/k, then it follows that x, € K, and this implies 
Κ, ΕΞ K,. If we use Eqs. (3), we similarly obtain K, ¢ K,. If p +m, then (2) 
implies tr x = mx, + Y, Ye K,, and that yields x, ε K,, which in turn 
implies K, Ξ K,. 
Proof of Theorem 2 Without loss of generality we may assume that 


22 BOHUSLAV DIVIS 
p + mn, and hence we may assume that tr x = tr y = 0. By (2), we have 

xy = Dy, Lure v,EK (l<j<n). (4) 
For every g € G = Gal(K,(x,)/K(x,)) is 


J 


If we add up all these equations, we obtain 


[K,(e1):Keeal] = LL vob) = Lv ob) = Lv tr y=0. 


geG j=l j=1 


This implies 


P\[Ky(1): ΚΟ ΓΚ, ν: ΚΊ. 


Theorem 3 The condition p|[K,, ν: Κ] must be satisfied in every deficient 
system. If m>n and p > n, then p|m. In particular, p < max(m, n). 


Proof Thecondition p|[K,, ,: K] is a consequence of Theorem 2 and of 
the “divisibility preserving” property of every reduction procedure to a prim- 
itive system. The second statement also holds for primitive systems. 
Namely, in the proof of Theorem 2 we found that p 7 mn implies 
p|[K,(x1):K(x1)], and Theorem 1 says that K,(x1) = K, if p 7 mm. There- 
fore, we must have p|[K,:K]|n!, which is tantamount to p <n, and as a 
consequence of the divisibility preserving property of every reduction 
procedure, it holds for every deficient system. 


Theorem 4 If one of the parameters m or n of a deficient system is a prime 
power, then p|mn. 

Proof It suffices to consider primitive systems only. If n= g’ and 
p + mn, then we have Eq. (4) as in the proof of Theorem 2. As a next step, 
however, we shall not apply the full group G = Gal(K,(x,)/K(x,)) to it but 
merely its Sylow q-subgroup G,, say. It is well known that G, is transitive on 
the y’s and therefore does not depend on j. It follows that 


[641 x1 = Σ Σ v,0(y,) = Σ v; » φίνὴ Ξ 9, 


geGq j=l jJ=1 @peG, 
which is a contradiction. 
Theorems 1-4 have been first proved by Isaacs [1]. The proofs given here 


are not essentially different, but they seem to be using simpler language. 
Also, they differ in the concept of a primitive system. 
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Now we would like to propose the following 
Problem Describe all primitive deficient systems. 


In this connection, it seems to be reasonable to introduce the following 
definition. A primitive deficient system will be called regular if k = ἡ < m, 
and irregular otherwise. 


Theorem 5 In a regular primitive deficient system with k =n <m is 
necessarily m = p*, where 1 <a<n— 1. 

Proof We have [K(x; + y,):K] =m and K(x; + γι, y1) = K(x, y1), 
and therefore the fields K(x, + γι) and K(y,) are linearly disjoint. If we have 
ΧῚ + Vy; =X2 + yz, Say, we have x, = x, + γι — y,. We can fix y, and send 
xX, + y, to an arbitrary sum x; + y; and then fix x; + y,; and send y, to an 
arbitrary y,. This shows that every expression x; + y; — γι 15. among the 
conjugates of x,. It follows that every expression 


Da iy) (3 4)=0. 0<4<p—1) 
j=1 j=l 


is a conjugate of x,. Conversely, by (2) every x; can be written in such a way. 
If we denote by « the number of linearly independent conjugates of y,, then 
m= p* (elements y;, y2, ..., y, are called linearly independent if 
Σ7.. Ay, € Καὶ Aj ε Z,> 4, =A, τὸ "-- τὸ 1, =0). 

The primitive systems can be described by the equations (1) satisfied by 
the conjugates x; + y,;. Two systems will be considered the same if the 
equations differ by a permutation of the conjugates x,,x,,...,x,, and γι» V2>5 
-.+» Y, only. By abuse of language we shall refer to these systems of equations 
as to the primitive systems, regardless of their algebraic representation. We 
shall write for brevity i j = k | instead of x; + y; =x, + yy. 

If 2=n<m, then k = 2, and thus m = p by Theorem 5. We have one 
regular primitive system for every p with equations 


1322 
2.13Ξ3.0 
p l= 12. 


If3 =k =n <m, then m= p’ or p. For every p, we have one regular system 
with m = p*. The equations can easily be written down. For p = 3 and for 
every p = 1 (3) there is one regular system with m = p. Let us remark that 
for p = 3 is tr y= 0 if m = 3, and tr y + 0 if m = 372. 

If 4 =k =n<m, then m = ρ᾽, p’, or p. For every p, there is one regular 
system with m= p°. If m= p?, there is one regular system for p =2 and 
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= —1 (4) and two regular systems for p = 1 (4). Namely, for every p there 
is the possibility γι + y2 = y3 + y4 = 0, and for p = 1 (4) we also may have 
γε =Ay, + (A— 1)y2, where A? = —1. If p= 1 (4), then there is also one 
regular system with m = p. 
Of particular interest are the irregular systems. There 1s just one such 
system for 3 =n <™m, with k = 2, m= 6, and p = 3. The equations are 


11.Ξ ὺ 41 -- 53 12=43 
21=3.3 51=62 32=63 
31-42 61=13 52 = 2 3. 


For 4 =n < m, there is also just one irregular system, with k = 3,m= 6, and 
p = 2. The equations are 


11=22=33 41=24=63 
21=12=44 42=14=53 
31=13=54 34=51=62 
32=23=64 43=52=61. 


Although the proofs of the last statements are rather tedious, they are not 
difficult in principle and will be omitted here. 

From these examples it might seem that mn/k is always a p-power. 
However, this is not the case. Namely, for n = 6, m= 10, k = 4,and p=2 
(mn/k = 15) we have the irregular system 


11=22=33= 44 16=62=83=104 34=43=56= 65 
12=21=55= 66 23=32=96=105 35=64=71=102 
13=31=75= 86 24=42=76= 85 36=54=81= 92 
14=41=95=106 25=51=84=103 45=63=82= 91 
15=52=73= 94 26=61=74= 93 46=53=72=101. 
Although we could construct infinitely many of such examples, namely 


m= 22:1... DF n= 27**+!~ 9 κτ 23, p =2, 


goo ἘΠ; pS Ny 2535 00% 


we were as yet unable to construct a system with p 4 mn. However, we can 
prove the following strengthening of Theorem 3. 
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Theorem 6 If m>nand p}mn in a deficient system, then p < 5n. 


Proof Let us suppose that $n < p <n. By Theorem 3, p must divide the 
order of the group Gal(K,(x)/K(x)) and therefore the group must be primi- 
tive on the y’s. Since it contains a p-cycle, the group must be 
(n — p + 1)-transitive. If we have x, + y,; ΞΞ Χῃ + 72, Say, then there exists a 
transformation that keeps x, and y, fixed and sends y, to an arbitrary y, 
(j # 1). Therefore k = n in contradiction to Theorem 5. 


Corollary If p } mn in a deficient system, then min(m, n) > 10. 
Proof This follows from Theorems 4 and 6. 


Now, let us study the products xy instead of the sums x + y. A first 
information can be obtained from [2]. As a matter of fact, the following 
theorem is contained in [2] although it is not explicitly stated there. 


Theorem 7 Let Καὶ be a field, 0¢ X <K,0¢YcCK, |X| =m, and 
| Y| =n. If there is no element z € K represented uniquely as z = xy, x € X, 
and y € Y, then there are two natural numbers e and f and two pairs of distinct 
elements X,, X, € X and y,, y, € Y such that 
a a and =e < min(2"*"~?, n™~') 
and 
γί = γῇ and [{-:- min(2"*"~ 2, m""!). 


The definitions of multiplicative deficient and primitive deficient systems 
would be analogous. We can prove 


Theorem 8 Let K, K(x), K(y), K(xy), and K(x, y) form a primitive 
(multiplicative) systems with [K,. ,:K] = g. Then x8 € K and y%e K. 
Proof Instead of Eqs. (2), we have 


x=Xi[] yp YAy=O0 (1 <is<m). (5) 
j=1 


N(x)= xT] yf, A= 
ji 


j 
If we apply all g € G = Gal(K,(x,)/K(x,)) to it and multiply the resulting 
equations, we get 


(Ge) = xt T] 


)Ξ 1 


(Tow. )) "= x 
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since K,(x,)=K,, by (5), [],<c (yj) does not depend on j and 
δὴ, A; = 0. The proof of y’ ε K is analogous. 


Let us remark that Theorem 6 does not hold in an arbitrary multiplica- 
tive deficient system. 

As in the additive case, we can define the regular primitive multiplicative 
systems by the condition 1 « Καὶ =n <m. Let us prove 


Theorem 9 In a regular primitive multiplicative system we have x” € Καὶ 
and y" € K. 


Proof The conjugates of x and y satisfy a system of m equations of the 
type 


Xi, V1 = XigV2 = = Xin 


If we multiply all of them combining terms containing the same y’s, we 
obtain 


N(x)yt = N(x)y2 = :-Ξ- N(&X)yn 


which implies γῆ = γῆ! = --- = y™ € K. Instead of Eqs. (2) we have Eqs. (5). If 
we raise them to the mth power, we get 


x™=x™ L<i<m, 


thus χε K. 
In difference to the additive case, we can prove 


Theorem 10 ΑἹ] primitive multiplicative systems are regular. 


Proof Let us assume that we have a primitive multiplicative system that 
is not regular. We can easily see that x” € K would imply the regularity of 
the system. Let m, > m be the minimal positive exponent such that χε Καὶ 
and y"! εἰ K. We may assume that we have a system with a minimal value of 
m and then also with a minimal value of m,. By Theorem 8, m, is not 
divisible by primes larger than m. Let p < m be such that p|my. Let us put 
X,;=x?, Y,=y? ἴογ 1 <i<mandi<j<n. The expressions X; cannot be 
all equal since otherwise x? € K. Therefore, K(X;) is a nontrivial extension 
of K of degree at most m. Since the equations for x; and y, are linked, the five 
fields K, K(X;), K(Y;), K(X1 1), K(X1, Y,) form a multiplicative deficient 
system. Finally, ΧΡ e K. Thus, the parameters of this new deficient system 
are certainly smaller than the parameters of the old system. Therefore, this 
new system must be regular. 

Conversely, let us start with this new regular system and put x? = Χ. 
This would force the old system to be imprimitive, which is a contradiction. 
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Indices in Cyclic Cubic Fields 


D. S. DUMMIT* H. KISILEVSKYt 
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PRINCETON, NEW JERSEY PASADENA, CALIFORNIA 


If K is an algebraic number field and O, the algebraic integers in K, let Ind(«) = [O,: Z[a]] for 
a € Ox. If m(K) = min, . 9, Ind(a), it is shown that m(K) is unbounded as K runs through cubic 
subfields of the cyclotomic fields of Ith roots of unity for | prime and that there exist infinitely 
many cyclic cubic fields K with m(K) = |. We also compute m(K) for some specific examples. 


Let Καὶ be an algebraic number field and let Ox be its ring of integers. 
Given « € Ox, define the “index” of «, written Ind(«), to be the group index 
[Ox: Z[a]]. This index is finite if a generates K over the rationals Q, and « is 
said to generate a “power basis” if this index is 1, 1e., Ox = Z[a]. If ὦ; = 1, 
W,..-, ὦ, iS an integral basis for Ox, then the index of α is the absolute 
value of the determinant of the matrix transforming the basis w,,..., ὦ, to 1, 
α, αὖ, ..., αὐ ς If d(x) = [ ]o.41 (α — o,(a)) (where the product is taken over 
all nontrivial isomorphisms of K into an algebraic closure), then 
Normx)g(5(«)) = D(1, α, ..., αὐ ἢ) where D(a,, ..., ,) is the discriminant of 
{a,,..., %,. Hence 


Normx,9(6(«)) = Ind(«)? - D(@,, ..., @,) = Ind(«)? - D(K), 
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where D(K) is the discriminant of K. Let 
= Χὶ bX We eX, Oye 
Since Ind(«) = Ind(« — x,), we may assume x, = 0. Then 
αἰ τὸ Lt +Sin Op 


where the f,, are forms of degree i in X),.--,X,- Hence the determinant of the 
transformation matrix is a form of degree 3n(n — 1) in the n — 1 variables 
X5, .+., X,- This form is called the “indicial form,” and we shall denote by 
m(K) the minimal value assumed by the form as « ranges over the integers of 
K. Ox has a power basis if and only if m(K) = +1. 

The question of the existence of a power basis was originally examined 
by Dedekind [4]. It was demonstrated that the ring of integers in the cubic 
field generated by a root of x* + x? — 2x +8 has no power basis. The 
Dedekind example led to the following result of Hensel [11]: 


Proposition The index of every element « € Ox is divisible by the prime p 
if and only if 


1 " 
g(f) >= a 5) 
> 5 Ba 
for some integer f > 1, where g(f ) denotes the number of prime ideals dividing 
(p) in K with residue class degree f (such primes are called “common index 
divisors” ). 


We shall call the case [K : Q] = 3 of this result the Dedekind test. We shall 
be concerned with determining the minimal index for cubic subfields of 
prime cyclotomic fields. 

The cyclotomic field of /th roots of unity K, contains a (unique) cubic 
subfield K if and only if | = 1 (mod 3). Let / be such a prime, and ¢ any 
nontrivial solution of x! = 1. Then Καὶ, = Q(¢) and the galois group g(K,/Q) is 
cyclic of order | — 1, generated by the automorphism τ: ¢ — C9, where g is a 
primitive root mod |. The cubic subfield K is then the fixed field of the 
subgroup <t°>. 

If we denote by « the relative trace from K, to K of C, then « is a generator 
of the field K, K = Q(a). This follows from the fact that is fixed by {τὴν and 
cannot be rational since fewer than / — 1 of the elements ζ, 67,..., ζ| 1 are 
linearly independent. 


Proposition 1 The element « = Trx,x(¢) forms a normal basis for Ox, i.e., 
the elements a, a(x), o?(a) (the K-conjugates of «) are an integral basis for Ox. 
Proof It suffices to show that each y € Ox can be written in the form 
γ = aa + bo(a) + co?(a) with a, b, c € Z. Since « generates K, y € Ox implies 
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y = ra + 5σ(α) + to?(a) where r, 5, t € Q. As also γε Ox,, considering this 
expression in K, and noting Ox, = Z[¢] shows r, s, t € Z. 


To determine the arithmetic of the field K, it is convenient to find the 
minimal polynomial satisfied by a. The solution to this problem is afforded 
by the theory of cyclotomy. 


Proposition 2 Let | be a prime = 1 (mod 3). Then 4I has the (unique) 
representation 4] = c* + 27d’, with c = 1 (mod 3), d > 0. With respect to this 
representation, the minimal polynomial for ἃ = Tr,,9(C) is 


ἰ-- Ἰ Ic+ 31-1 
f.{x) = x3 + x? = (=) = [533 


Proof The number a is a “cyclotomic period” of degree e = 3 (,.6., 
a= δ᾽ σ((), σι € <t*>). The period equation satisfied by a is [5, p. 393] 


l+a l 1 
(1,0)e (1, 1) -- α (1, 2) 
(2, O)a (2, 1) (2, 2) -- α 


where the (i, 7) are the “cyclotomic numbers” associated with e = 3. These 
numbers can be computed in terms of |, c, and d, where 4] = c? + 2742. 
c = 1 (mod 3). From Storer [14], 


18(1, 0) = 21 -- 4 -- ς -- 94 
18(1, 1) -- 21-- 4-- ς - 94 
911,2)=l+c+1 

(2, 0) = (1, 1), (2, 1) = (1, 2), (2, 2) = (4, 0). 


Solving the determinant with these values yields the polynomial f,(x). 

The polynomial f,(x) determines the splitting of all primes in K. The 
prime ἰ ramifies. Since discriminant f,(x) = [242 (an elementary calculation), 
the splitting of all primes p 7 d is determined by the splitting of f,(x) in Z/pZ 
[12, pp. 27-29]. If now p|d, then [242 = 0 (mod p), hence f,(x) splits com- 
pletely mod p. Thus ἃ, (a), σ᾿(α) € Z/pZ. But since «, σ(α), σξ(α) generate 
Ox over Z, ἃ, σ(α), σξ(α) generate Ox /p over Z/pZ (pa prime lying above p). 
Hence f(p/(p)) = 1, and p splits (completely) in K. Thus the splitting of all 
primes is determined by f,. 

The theory of cyclotomy also allows ready computation of the indicial 
form. Using the integral basis 1, «, o(«), and using the relations among the 
periods in Dickson [5, p. 393], we have 


= 0 


B=xa + yo(a) 
B? =foy (1 +fo2 χα Ὁ [23 σ(α) = Ax + Bo(a) + Co?(x) 
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where 
A = (0, 0)x? + (0, 2)y? + 2(0, 1)xy — (p — 1) 
B = (0, 1)x? + (0, 0)y? + 2(0, 2)xy — (p — 1) 
C = (0, 2)x? + (0, 1)y” + 2(0, 0)xy — (p— 1). 
Thus 
1 0 0 
1πη4(β) = + 0 x y 
—C A-C B-C 


Computing the determinant with 
(0,0) =!—8+¢, (0, 1) = (1, 0), (0, 2) = (1, 1) 


we obtain the indicial form. We summarize the results in the following 


Proposition 3 Let B € Ox. Then the index of B is given by 
—c— 3d d 
+Ind(B) = dx? + [-5:.» γῈ [ “τ ὅς γ»ν + dy° 


where 4] = c? + 2742, c = 1 (mod 3), d > 0, and B = xa + yo(a). 


This representation for the indicial form gives immediately an upper 
bound for m(K). For x = 1, y=0, we have Ind(«) = d. Hence m(K) < 
(41/27)'!?. 

The question of the existence of a power basis is reduced to considering 
whether this form represents 1 (note that if the form represents +1 then it 
represents 1). Direct numerical tabulation of values represented by this form 
allows us to determine power bases in some cases, for example | = 241 
(x = —1, y=2), 373 (x = —2, y=3), 379 (x=-—1, y=3), 463 (x= 
—ly= 2), 751 (x = —1, y = 3). 


Proposition 4 2 splits (completely) in the extension K if and only if 2 
divides d. 

Proof Since a, σ(α), σξ(α) are a Z-basis for Ox, their images under the 
map Ox,—Ox,/p generate Ox/p over Z/pZ. Suppose 2 divides d. Then 
discriminant( f,(x)) = 0 (mod 2), so f,(x) has a multiple root mod 2, hence 
splits completely mod 2 (a cubic having a multiple root has all its roots in 
the ground field). Thus ἃ, o(«), o7(a) € Z/2Z, i.e, [Ox /p2: Z/2Z] = 1. Hence 

f (p2 /(2)) = = 1, and 2 splits in K. Suppose now that d is not divisible by 2 and 
2 splits in K. Then f,(x) must split completely into distinct linear factors 
mod 2, clearly impossible. 
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Theorem 1 (Dedekind) If d =0 (mod 2), then no power basis exists, 
and in fact every index is divisible by 2. 


Proof This follows immediately from the Dedekind test and the above 
proposition. We may also see this directly from the indicial form as follows: 
4] = c? + 27d? implies c and d have the same residue class mod 2. Hence if d 
is even, c is also. Writing the indicial form as 


2x5 NSD 2 2 
4(χ 4. γ᾽)-- {55:5} Ξ sa} 


and noting x?y + xy? = x*y — xy? = 0 (mod 2) for all values of x and y, we 
see that the form is always even. 


Remark By Hensel’s result above, 2 is a common index divisor in cubic 
fields if and only if 2 splits completely in the extension. By a result of 
Engstrom [6], 2’ is the only possible common index divisor in cubic fields, 
e.g., 4 never occurs as a common index divisor. References [2, 4, 6, 9, 15] yield 
a fairly complete history of the results on common index divisors. 


Lemma 1 Let p be a prime = 1 (mod 3) and let p be a primitive third 
root of unity mod p. Let a be any nonzero residue mod p. Suppose α΄ '‘d is a 
cubic nonresidue mod p and that c = (6p* + 3)d mod p. Then the indicial 
form does not represent a. 


Proof For c = (6p? + 3)d mod p, the indicial form becomes 
f (x, y) = dx? + 3pdx?y + 3p7dxy? + dy” 
since 1 + p = —p” mod p. Hence, 


I (x, y) = d(x + py)* mod p 


and the values assumed by the indicial form lie in the same cubic residue 
class as d. Since a and d lie in distinct cubic classes by assumption, a is not 
represented by the form. 


Using Lemma 1 we are able to construct an infinite class of fields that do 
not satisfy the conditions necessary for the Dedekind test but which never- 
theless do not have a power basis. This set of fields is constructed in such a 
way that the minimal index m(K) is unbounded as K ranges over the set. 
That the minimal index is unbounded over the set of pure cubic fields is a 
result of Hall [9]; the following shows that the same is true for the set of 
cyclic cubic fields. We shall need the following special case of the generalized 
Dirichlet theorem. For the proof as a consequence of the Tchebotarev den- 
sity theorem, see Bass et al. [1]. 
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Theorem Let F be a number field. Given nonzero a, b € Οε and a nonzero 
ideal M such that aO,; + bO,;, = O; = UA + bO;,, there exist infinitely many 
primes p such that p& = mO; for some m = a (mod bO,). 


Theorem 2 Given any N > 0, there exists a cubic subfield Καὶ of a prime 
cyclotomic field K, = Ο(ζ), ¢ a primitive Ith root of unity, such that m(K) > N 
and 2 remains prime in Ox. 


The condition that 2 remain prime in Ox, by Proposition 4, implies that 
the Dedekind test for the nonexistence of a power basis does not apply, 1.6., 
there is no common index divisor. 


Proof Let p,, ..., py be distinct primes = 1 (mod 3). Let p; be a primi- 
tive cube root of 1 mod p,. Choose d; so that i-* - d; is a cubic nonresidue 
mod p;, and let οἰ = (6p7 + 3)d;. By the Chinese remainder theorem, let c 
and d be solutions to the following system of congruences: 

c =dz= 1 (mod 2) c =c, (mod p,) 
c = 1 (mod 3) 3d = d; (mod p,). 


Let now F = Q(,/—3) in the Dirichlet theorem above, and set 


θᾶ Ν 
gf t Xv vee, b=6|[p;, W=O,. 
i=1 


Since 


c* + 27d? (69? + 3)? +3 
Norm,,g(a) = ae et d? [3:5 -- 


mod p, = — 6d? mod ρ; 
and d, # 0 (mod p,), it follows that (a) is relatively prime to ([ [ἢ 2 p;). Also 
Norm,/g(a) = 1 (mod 6) 

so (a) is also prime to (6). Thus 
(a) + (b) = WU + (δ) =O,;, 


and by the Dirichlet theorem, there exists a prime p such that p = (m), where 
3d. /— ᾿ 
n= “ΤῊΝ 9 mod( 6 I] ri) 
i=1 


If the prime p were of degree 2, then p = (p) = (m), where p is a rational 
prime. Hence p ~ m in O,. The units of F are +1, +@, and +w?, where 
w = 4(1 + ./—3). Hence we would have 


(‘ + ae τό Πρ + ny — 3) 


i= 


INDICES IN CYCLIC CUBIC FIELDS 35 


with x, y € Z, and such that 
m= +p, tpw, +pw’. 


The first is impossible since 
N 
3d + 6y| | p; = 1 (mod 2), 
[Ξ1 


so m cannot be real. The latter two cases would imply 


+p+p/—3 
m= ————_>—_ 
2 
which is also impossible since 


N N 
εἰ. + 6x[ To] = (34 + ΤΠ) 
i=l i=1 


is impossible mod 3. 
Thus the prime p is of first degree, so 


(6+ SLs pra) Cds 611 εν) J=3) _ 8 +270 


= Normrig 5 


is a rational prime with 
s=t=1 (mod 2) 
s = c; (mod ρ)), t = d; (mod p;). 

Let K be the cubic subfield of the prime cyclotomic field K, for this ἰ. The 
numbers 1,..., N do not occur as indices by Lemma 1, and 2 remains prime 
in K by Proposition 4. 

The indicial form and the integers it represents can also be connected 


with a certain norm form in the cubic field K. Letting B = χὰ + ya(a), we 
know 


D(1, B, B’) = Ind(B)’ - D(K), 
or since D(K) = 1’, 
Ind(B)’ - I? = N(6(B)). 


The prime ἰ is totally ramified in K, and the prime sitting above / in K is 
principal, say 


pao) 
(y can be taken to be the relative norm from K, to K of 1 — ¢). Since / is 
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totally ramified, we can also write 
[2 = Normg,o(y)* = Norm(yo7(y)). 
Letting δ(β) = (B — o(8)(B — 07(8)), we have 
, _ | Norm(f — σ(β)) β — o°(8)) 

In = emo 
Norm (>= 2) - σῷ } Norm( > ol ) , 

γ o*(y) 
Applying o to the second norm, we find that 


π balan τι}, 


Norm 


so that 
Ind(f) = Normyyo(>— 7) 


Note that as / is totally ramified, the inertial group of p = (7) is the full 
galois group, so β = σ(β) mod » for all σ. Hence the element (β — o())/y is 
an integer. We write this as 


Proposition 5 Ind(f) = | Normx,o(&)| where € = (B — o(B))/y € Ox. 


By this proposition, the existence of certain indices implies certain rela- 
tions in the class group of K. In particular, in the case of prime indices, we 
have the following result. 


Proposition 6 Suppose the prime p # lis an index in K. Then p splits in Καὶ 
into three principal factors. 

Proof By Proposition 5, p= Norm(é) for some ¢€ € Ox. Since p is 
unramified in K, p = p, p2 p3, with f (p; /(p)) = 1. Further, (ζ) = p, for some 
i, is principal, and since the p; are conjugate under the Galois group, each p; 
is principal. 


If the structure of the class group is known, Proposition 6 is useful in 
eliminating certain possible values for m(K). For example, if | = 277, then 
m(K) = 2 or 4. Determining the splitting of all primes p < ὁ] to determine 
the class group structure (the constant 4 is due to Davenport [3]), using 
f,Ax) = x3 + x? — 92x + 236 (see the comment following Proposition 2), 
shows that only 2, 13, 19, and 37 split in K. Since Norm(« — 4) = 27 - 13, 
Norm(a« — 8) = 2? - 19, and Norm(a — 5) = 2 - 37, we have 


Cx = <([Po], [p2)> 


where Cy, is the class group of K and [p,], [p2] are the classes of two 
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(conjugate) primes lying above 2 in K. Since the class number of K can be 
computed by analytic techniques, and h(K) = 4 for this field [8], p, cannot 
be principal. Hence by Proposition 6, 2 is not an index, so m(K) = 4. Similar 
calculations hold for the prime | = 397, but for all other undecided m(K), 
Ι « 1000, h(K) = 1, so Propositions 5 and 6 are of no help. 

We remark that the converse of Proposition 6 does not always hold, for 
example in the case | = 499, where h(K) = 1, and 2 splits into necessarily 
principal factors, and yet 2 is not an index (congruence considerations of the 
indicial form mod 9). 

Let now n be a square-free integer, not necessarily prime. If 
n = 1 (mod 3), then the cyclotomic field K,, has a cubic subfield K of discri- 
minant n?. By Hasse [10], the element « = (—1)’** Trx,x(¢) generates a 
normal basis for O,, where r is the number of distinct primes dividing n and 
ζ is a primitive nth root of unity. Further, « has minimal polynomial 


κα a 


where 4n = c? + 27d”, c= 1 (mod 3) as in the case for n prime. 


Theorem 3 There exist infinitely many cubic subfields K of cyclotomic 
fields K,, for which m(K) = 1, i.e., a power basis exists for the ring of integers in 
K. 

Proof The index of « defined above is given by (D(f,(x))/D(K))'/?, and a 
straightforward computation of D(f,(x)) shows « has index d. Hence it 
suffices to show there exist infinitely many square-free integers n such that 
4n = c? + 27, or equivalently, n = k? + k + 7 (see Narkiewicz [13]). Let N, 
denote the number of square-free integers of the form k? + k + 7 for k < x. 
Then N, > x — M,,, where M,, denotes the number of integers of the form 
k? +k+ 7 for k <x and having a square factor. Hence, 


M,, = card{k < x|k? +k + 7 =0 (mod p’) for some prime p}. 


We may clearly take p < x. Since the congruence k? + k + 7 =0 (mod p’) 
has at most two solutions on any residue system except for p = 3, we have 


M,<(3-5+3) + » (2-3, +6,] 
9 Ss psx Ρ 


where θ, < 2. Hence 


N,2x(3-2 a) ~201=Cx + ofx) 


p25 psx 


Since C= $4-—2).,,5 p 7 = 0, we have N, — 00 as x — 00, concluding the 
proof. 
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The determination of m(K) for any particular case reduces to solving a 
finite number of cubic Diophantine equations, namely f(x, y) = a, where 
f(x, y) is the indicial form and a is a positive integer less than (41/27)!/2. In 
conclusion, we determine m(K) for the cubic subfield K of K,,,. We let N(A) 
denote Normx,9(A). 

For | = 181, we have c = 7, d = 5, and the indicial form for this field is 


f(z, y) = 529 — 112z?y — 4zy? + Sy?. 


The prime 3 remains prime in this extension, so by Proposition 6, cannot be 
an index. Further, for d odd, m(K) is also odd. (Proof: The indicial form 
mod 2 for d odd is either x° + x?y + γ᾽ or x° + xy? + γ᾽ since one of 
—c — 3d and c — 3dis = 0 (mod 4) and the other is = 2 (mod 4). Hence, an 
even index is represented only for x=y=0(mod 2). But then 
(χα + yo(«))/2 is an integer with smaller index, a contradiction. Hence m(K) 
is odd.) Thus m(K) = 1 or 5. We shall show m(K) = 5 by showing f(z, y) = 1 
for integers z and y is impossible. 
Assume f(z, y) = 1, for z, y ε Z. Then letting x = 5z, we have 


25 = 25. f(z, y) = x3 — 11x?y — 20xy? + 125γ5. 


By Proposition 5, f(x, y) is always reducible in K[x, y], and in this case we 
have 


(25) = (x — By)(x — o(8)y)(x — o7(4)y) (1) 
where θ = a + 4. Define p, by . 
(a — σ(α)) = Ps Piss 


(recall Ind(«) = d = 5). Since N(@) = — 125, (@) is divisible by three primes 
above 5. Also Νίθ + 5) = —5* - 7. If (8) were divisible by the three distinct 
primes above 5 in K, then (5)|(@ + 5) (| denotes divisibility), which would 
imply 5° | N(@ + 5), a contradiction. If (θ) were the cube of a prime above 5, 
then Ν(θ + 5) would not be divisible by 52. Hence (θ) = αἱ ας, where ας, ας 
are two distinct primes lying above 5 in K. Since (6 — σ(θ)) = (a — a(a)), we 
have by definition of ps, ps [{θ — o(8)), and considering the possibilities for 
ας and q;, we see that only (0) = pz p; and (6) = (p$)*p, are possible, where 
p, = a(ps) and p% = o7(ps). In either case, since 5|x, we have 


Ps Ps |(x — yO) 
and since N(x — γθ) = 25, 
(x — γθ) = ps Ps - (2) 


An integral basis for this field is given by a, o(a), and o7(a), or in terms of 
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the computationally easier 1, «, «’, 


8 — 2a — a? — 43 -- 5 
πο a 
5 5 
There are two conjugate fundamental units for this field, given by 
26 + 21a — 2a? 54 — 16a — 3a? 
τ a ce Ce 


(these units were originally calculated by the method of Godwin [7], and are 
the same units given in the tables of Gras [8]). 
We may calculate p, explicitly from (a — o(«)) = ps pig, to be 


-[Ξ 333] 
ps = 5 


with conjugates 
, (7+ 7Ta+ a7 , (126 + a — 2a? 
= — Pr Ay —— ea 


Hence, from Eq. (2), we have 
x — yO = et e™(4)?(2 — 8a + u7)(126 + α — 207) 
x — γσ(θ) = εἰ "6 ™(4)7(7 + Tx + «?)(2 — 8a + αὖ) 
x — yo?(0) = εἴ "ε5 "(4)?(126 + α — 2u7)(7 + 7α + a7) (3) 


where we have used the fact that δ, and 82 are conjugates to obtain the 
second two equations, and where m and n are integers. 

We shall show the equations in (3) are impossible by considering them 
first modulo p, (a prime lying above 7 in K) and then modulo p,q (similarly 
a prime lying above 19 in K). Note that Νία — 2) = 5* - 7 and N(a — 7) = 
5-19, so that both p, and p,, are first degree primes in K, which may be 
defined as the kernels of the homomorphisms Ox > Z/7Z (by «— 2) and 
Ox Z/19Z (by a -- 7), respectively. 

Modulo p,, we have, since « = 2 (mod p,), 


€, = 5 (mod p,), 8 = 2 (mod p,) 


(’ 2a EE) 5 miedp) 
[55:55] - 5 (mod p,) 
(125+ 2-28") = 5 (mod νη 


06=a+4=6(mod p,), σίθ) =3 (mod p,), 
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and 
o*(0) = 2 (mod p,). 
Hence Eq. (3) becomes 
x + y = 5"2" = (—1)"2"*™ (mod p,) 

x —3y=57-"2"—-™4 = (-- 1)525- 24 (mod p,) 

χ-- 2γξ 55 52. "Ξ (-1)"*"2"" 7" (mod p,). (4) 
Then 


ἘΠ" x — 2» Ἢ 
ara a (mod p,), x 3y (-- 1)" - 2 (mod p;) 


imply that m is odd and n is even (m even implies y = 0 (mod 7), and the 
indicial form does not represent 1 for such y, and m odd implies 
x = 4y (mod p,), which shows (x — 2y)/(x — 3y) =2 (mod pj), ie. n is 
even). 
We now consider Eqs. (3) mod p,,, where « = 7 (mod p,9). We have 
here 
δι =15 (mod pig), = &. = 16 (mod ρΡ,9) 


[Ξ5-- 8a + o +) = BiGuod ais) 


(7+ 74") 2 


2 (mod pj9) 


126 ἐδ: a -- 20 
== 5:8 56) -τὶ = 7 (mod p49) 


θ = 11 (mod pjo), Ps = —1 (mod pj), o*(8) = 1 (mod po). 


Here we have 
x — lly = 15"16"12 (mod p,9) 


x+y=15°-"16"~ "17 (mod ρΡ,9) 
x — y= 155. 516. "14 (mod pyjo). 


Since n is even, m odd, set n = 2k, and m= 2k, + 1. Then the first two 
equations become 


x — Ly = (157)*(16)"* - 2 (mod p19) 
x + y = (157) (16)"" "2 - 3 (mod pio). 


Now, 2 is a quadratic nonresidue mod p49, as is 3 (the quadratic residues 
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127 
139 
151 
157 
163 
181 
193 
199 
211 
223 
229 
241 
271 
277 
283 
307 
313 
331 
337 
349 
367 
373 
379 
397 
409 
421 
433 
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-- 14 
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m(K) 


439 
457 
463 
487 
499 
523 
541 
547 
571 
577 
601 
607 
613 
619 
631 
643 
661 
673 
691 
709 
727 
733 
739 
751 
757 
769 
787 
811 
823 
829 
853 
877 
883 
907 
919 
937 
967 
991 
997 
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are 1, 4, 5, 6, 7, 9, 11, 16, 17), so both x — lly and x + y are quadratic 
nonresidues. Tabulating the pairs (z, y) for which f(z, y) = 1 (mod 19), we 
find that for no pairs are 52 — 11y and 52 + y both nonresidues. Hence the 
equations (3) are impossible, and m(K) = 5. 

Table 1 lists the indicial form coefficients (as dx* + Ax?y + Bxy? + dy’), 
and the known m(K) for | < 1000. In the cases where several indices are 
listed, they are the only possible values for m(K) but it is undetermined 
which is correct. Except for the case | = 181 above, these values were ob- 
tained by Proposition 6 (using the tables of Gras [8] for the class numbers) 
and by congruence considerations of the indicial form. Note that c has been 
normalized positive in the table and that this does not affect the values 
assumed by the indicial form. 

Finally, we wish to acknowledge the assistance of George Cooke for 
helpful suggestions and comments. 
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Spinor Genera under Field 
Extensions, III: Quadratic Extensions 
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It has been shown by the authors that any two integral quadratic forms 
in the same genus over the rational number field which are not properly 
spinor equivalent do not become properly spinor equivalent in any odd- 
degree extension of @ (see Earnest and Hsia [7]). But examples given in 
Earnest and Hsia [4, 6] have indicated that this type of behavior cannot, in 
general, be expected for quadratic extensions. It is the purpose of this paper 
to give constructive methods for determining the amount of collapsing of 
proper spinor genera in quadratic extensions of @. Additionally, several 
results are obtained giving information about the number of proper spinor 
genera in the genus of the lifted lattice. 

The vehicle used in studying the lifting behavior of spinor genera is the 
correspondence, first described by Kneser in [11], between the proper spinor 
genera and a factor group of the idéle group of the coefficient field. The 
noncollapsing of the proper spinor genera in odd-degree extensions can be 
established by showing the injectivity of the natural lifting map between the 
corresponding idéle groups. In the case of quadratic extensions, however, the 
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kernel of this map may be nontrivial and it is this kernel that we shall 
compute in Section 1. Of course, in order to determine the idéle groups 
involved, it is first necessary to compute the local spinor norm groups of 
the lattice under consideration. This can be carried out for the spots on 
the rational number field using techniques from Kneser [11] and Earnest 
and Hsia [5]. However, these spinor norms have not previously been 
determined for the dyadic spots on quadratic fields. While the general prob- 
lem of calculating the spinor norm groups over arbitrary dyadic local fields 
is extremely difficult, it is possible to use the strong local condition that 
the ramification degree is less than or equal to 2 to extend the results of Hsia 
[9] and Earnest and Hsia [5] to perform the necessary calculations for qua- 
dratic fields. This is done in Section 2. 

The third section of this paper shows several applications of the 
techniques of the first two sections. Examples of Z-lattices and quadratic 
extensions are constructed showing that the number of spinor genera which 
collapse may be any arbitrary 2-power. We give a condition sufficient to 
assure that, while distinct proper spinor genera from the genus of a Z-lattice 
L may collapse in the genus of the lattice L lifted to a quadratic extension, 
the number g*(L) of proper spinor genera in gen(L) is no smaller than the 
number g*(L) in gen(L). The condition is satisfied, in particular, when there 
is no rational prime that divides both the discriminant of the field extension 
and the determinant of the lattice. Finally, the computations appearing in 
Section 2 are applied to give sufficient conditions, in terms of the reduced 
determinant of L, for the number g*(Z) to be smaller than or equal to the 
order of the 2-Sylow subgroup of the ideal class group of the quadratic field. 
Of course, if the class number of the field 1s odd, this gives g* (L) Ξε 1. 

The techniques described in Section 1 are easily adaptable to the use of 
computer methods for the calculation of the kernel of the lifting maps. A 
sample of data obtained with the use of the IBM System 370 at the Ohio 
State University is contained in the first author’s doctoral dissertation [2]. 
Further results will appear in a future paper, written jointly with John H. 
Yang. 

We now recall briefly the notation established in our previous papers 
dealing with spinor genus behavior. Unexplained notations can be found in 
O’Meara [12] and Earnest and Hsia [6]. With the exception of Section 2, L 
will denote a Z-lattice and [= L @, 0 the lattice lifted to the quadratic 
extension E = Q(,/m) (here o denotes the ring of algebraic integers of E). 
The proper spinor genera in the genus of L correspond bijectively to the 
elements of the factor group J,/P,J,J,, where V denotes the quadratic 
space on which L is defined. The spinor norm map induces an injective map 


Pi: Jy/PyJyJ i ὐϑωΡρυᾷ = G,(L) 
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which is in fact an isomorphism when dim L > 3. The corresponding groups 
can be considered over the extension E and for any prime spot p on E lying 
over p, the lifting maps g: Jg— J, and h: Jy > Jp, defined respectively by 


(Glip))p=jip and (μί(σ,)), Ξ σ, ΘΕ, 
induce homomorphisms wy, and I, in the commutative diagram 


ΦΙ. 
Τρ Ρουρῦν —— Gr 


OL 
Jy/PyJyJ_, —— GQ(L). 


Upon determining Ker(w,), it is possible to relate directly back to the col- 
lapsing behavior of spinor genera by using the method outlined in Appendix 
B of [7]. 


1. Computation of Ker(w,) 


The problem of computing Ker(w,) is twofold. First, we need to identify 
a set of elements of J whose classes modulo Pp) J§ give a complete list of the 
elements of Yg(L) (since dim L > 3, this set is finite). Second, we need an 
effective way to determine for a particular j€Jg whether w,(j)= 
(ἢ ε Y,(L) or, equivalently, whether h(j) ε Pp Jz (h denotes the map of Jg 
into J, that induces y,). These two questions are closely related and will be 
treated with similar techniques. To determine whether j equals (1) in Yg(L), 
we need to check whether there is some element « of D for which αἱ € Jg. 
Since there are an infinite number of possible choices for such a, the problem 
is essentially that of finding some finite subset of elements of D that represent 
all possible cosets of the elements of D modulo J§. The second problem can 
be likewise reduced to finding a finite subset of elements of D that represent 
all possible cosets of elements of D modulo υξ. 

We deal first with the question of determining representatives for the 
elements of Gg(L). There are two subsets of the set S consisting of all finite 
rational primes; these are related to the lattice L and are important for this 
investigation. Denote the set of “exceptional” primes for L by 


&(L) = {p € 5: 0(0* (L,)) # u, Q?}. 


We distinguish also a subset of &(L) that consists of those primes that will be 
used as “multipliers”; let 


ML) = {p € S: 0*(L,)) ¢ u,Q2}, 
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Note that .@(L) < &(L) and that both sets consist of only finitely many 
primes. The significance of these sets lies in the following proposition. 


Proposition 1.1 (i) Let L be definite andj € J°,. Thenj = (1) € GQ(L)if 
and only if there is some m = | | ,¢ a1 P’® with δ, € {0, 1} for each p, for which 


mj, € W(O*(L,)) forall pe &(L). 


(ii) Let L be indefinite and j ε σὰ. Then j = (1) € YQ(L) if and only if 
there is some m= +| |, <1 P*”. with €, € {0, 1} for each p, for which 


mj, €9(O*(L,)) forall pe &(L). 


Proof of (i) (<=) For pe &(L), we have mj, ε 6(0*(L,)) by assump- 
tion. For all p discrete, p ¢ &(L), we have m € τι, and Jp Eu, BY choice of j 
and by the form of m. Thus, for such p, mj, € u, e u, Q2 = = 6(0 *(L,)) by the 
definition of &(L). So, mj € JG and j € PpJ&. 

(=) Since j= 1, there is some «¢€D for which of ευξ. Write 
a=|]|pesp, vp Ε Z. For any p ¢ ML), v, must be even since jp € Up and 


0(0*(L,)) Ξ τι ὦ2. So a can be rewritten « = [9 εκ pag where 
QO if v, even 
1 if, v, odd. 


If « = [],<.at p, then a'j ε JG as desired. 
The proof of (ii) is identical except that in the indefinite case P,) = Pg 
and so — 1 must be included as a possible factor. QED 


% eQ and ¢= 


Consider an arbitrary nontrivial element of YQ(L), say j for somej € Jg. 
The representative j may be changed via multiplication by an element of Pp 
without changing the class 7. So k=j where keJ%, is defined by 
k=[]|,es p %4?4)j. Thus, a representative of each element of Yg(L) can be 
chosen from σὰ. Furthermore, for j, Κα J, if j, ε k, ΟΣ for all p ε &(L), 
then j = k. Define the set δ (1) by &(L) = {p € &(L): 6(0 *(L,)) # Q}. So it 
suffices to consider only the square classes of the coordinates at the primes p 
in &'(L). Thus, a set of representatives of the nontrivial elements of Gg(L) is 
contained in the set of elements j € J3, satisfying 


._ fil if péé(L) 
Je \n, if, pe BL) (+) 


where, for p odd, A, is either 1 or A, (the fixed nonsquare unit of Q,), and A, 
is one of the elements 1, 3, 5, or 7. Moreover, to determine whether two such 
idéles represent distinct classes, it is sufficient to check whether there exists 
some m= +| [,- a1) P, with ¢, € {0, 1} and —1 is used only when L is 
indefinite, for which j, € mk, 0(O* (L,)) for all p ε &’(L). 


Pp 
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The procedure for determining a list of representatives of the classes of 
G (L) can now be outlined. After determining the set &’(L), we consider the 
list h,, ..., ἢ, of elements of σὰ described by (*). We wish to eliminate 
redundancies from this list to obtain exactly one representative for each 
distinct class modulo Pp) J§. Let m, = 1,m,,..., m, bea list of the integers of 
the form +[[,< κω P”, where εν € {0, 1} and —1 is used only when L is 
indefinite, which are pairwise distinct modulo JG. Then m,hy, ..., mh, are 
equivalent to distinct elements of the list h,, ..., h,. These elements can be 
removed from the list because m;h, = h,. Let u be the smallest index j for 
which h; has not been removed from the list; eliminate all the multiples 
m,zh,, ..., m,h,. Eliminating successively in this manner, a list of elements 
distinct modulo P,J% is obtained in a finite number of steps. 


Example 1.2 Consider the lattice L = <1, a”, 7a*>, where 
= 35% 19; 
Local computations show that 6(0*(L,)) = u, Q? for p #2, 3, 7, or 19, 
θ(Ο" (L3)) = Q3, 0(0*(L;)) = Q,, 0(0* (Ly9)) = Qj,, and 
0(0* (L2)) = Q, ; 
Thus, .@(L) = {2, 7}, &(L) = {2, 3, 7, 19}, and 6΄ (1) = {3, 19}. A list of pos- 
sible representatives of nontrivial classes of Yg(L) is formed by j, k, and jk 
where j and k are defined by 
ἬΝ if p=3 κ --[Δ» if p=19 
"πὴ if p#3; ἢ 1 p £19. 

The list of integers generating principal idéles which are possibly distinct 
modulo Jg is 1, 2, 7, and 14. Checking coordinatewise, we see that (2) = 
jk (mod JG) and (7) = (1) (mod J6). So jk = (ἢ and j = (2)k (mod J4) gives 
j =k. Hence, |GQ(L)| = 2 and j is a representative of the nontrivial class. 

We now turn to the question of determining for a particular nontrivial 
element j of Yg(L) whether or not ¥,(j) is a nontrivial element of G,(L). It is 
helpful to introduce subsets of the set T of all discrete prime spots of E, 
which correspond to the set &(L) and .@(L) in Q. Let 

&(L) = {p € T: 0(0* (L,)) # u, E? 
and 
M(L) = ἵν € T: 0(0* (L,)) ¢ u, ΕΞ). 


Remark 1.3 The set .4(L) can be determined from .@(L). Let p be a 
rational prime and p a prime spot on E with p|p. We make several 
observations: 
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(i) If p+disc(E/Q), then 6(0* L,))¢ u,Q> implies 0(0*(L,)) ¢ 
u, ΕΖ. Hence, p ε .M(L) implies p εχ 
(ii) If p is odd and p|disc(E/Q), then 0(0*(£,)) < u,E2; hence 
p¢ ML). ᾿ 
(iii) When p=2, p|disc(E/Q) implies 6(0*(£,))< u, ΕΖ unless 
E = Q(,/m) where m = 3 (mod 4) and L, is split by a sublattice of the type 
2’<é, 26>, r € Z, ε, 6 € u. This will be verified in Section 2. 


We shall now describe the square classes of those elements of D that have 
even order at all the discrete spots on E. If E is a real quadratic field, let ε 
denote a fundamental unit of E; if E is imaginary let ¢ denote the element — 1 
except when E = Q(./—1) or Q(./—3), in which case ε =i or ε = --ὡς, 
respectively, where ὡς denotes a cube root of unity. 


“eee” 


Proposition 1.4. Let « € E with ord,(«) even for all p € T. Let p,,..., p, 
be all the rational primes dividing disc(E/Q). Then 


t 
772 
ae +te™|] pik 
i=1 


for some ἡ, ε {0, 1}, i=0, 1,..., t. 

Proof Consider the ideal ao, = θ΄. Since ord,(x) even for all pe T, 
there is some ideal 8 for which 91 = 8. If B is principal, say B = βοε, then 
a = B?- y, ye ug. Thus, α € +E? for 49 = 0 or 1. If B is not principal, 
then % is a class of order two in the ideal class group of E. Every ideal class 
of order two can be represented by an ideal of the form ] Ti=s pt ἡ; € {0, 1) 
where p,, ..., p, are the ramified primes of E (for a discussion of this well- 
known result see, e.g., [10]). It follows that 


t t 
B= (ita and W= 82 = (Ie): 
i=1 i=1 
where p,|p; . 


So a = yt*(| [i-1 pi’) with y € ug; that is, « ε (+e")([]i_-, pt)E? for no = 0 
orl. QED 


Corollary 1.5 Let L bea Z-lattice with dim L > 3 and let « € E for which 
ord,(«) is even for all p € T. 


(i) If Lis indefinite, if m <0, or ifm> 0 and Nzgja(é) = +1, then 
t 
aeD ifandonlyif «ae £2 [Tore n; Ε {0, 1}. 
i=1 
(ii) If L is definite, m> 0 and Ng ale) = —1, then 


t 
ae D ifand onlyif aert [Πρ] η, € {0, 1}. 
i=1 
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Returning now to the computation of Ker(w,), we take a nontrivial 
element j of YQ(L). As in previous arguments: we may choose the representa- 
tive j from J§,. Ifa € Dis such that ah(j) ε Jf, we must have ord, (a) even for 
all p ¢ M( i), but the order may be arbitrary at those p in .@(L). So in 
addition to the elements a described in Corollary 1.5, we must now also 
consider those a possibly having odd order at some subset of the spots in 
MC). | 

For a set Q,,..., Ὡς of spots in .@(L) we wish to describe those elements 
a of E satisfying 


ordg(a) odd for i=1,...,s 


(**) 
ord,(«) even for ΡΞ Ὦ;, ἰΞΞ1..ὕ..5.Ψ 


Πα and β are two elements both satisfying the conditions (++), then ord,(«) 
is even for all p ε T. So the square classes of « and β differ only by a product 
of those elements obtained in Proposition 1.4. That is, in order to describe 
the square classes of elements satisfying («*) it suffices to obtain only one 
such element. The question of the existence of such an element can be settled 
by an evaluation of Hilbert symbols. 


Proposition 1.6 Let Q,,..., Ὡς be spots in M(L) with Q,, ..., Ὧν not 
inert in the extension E/Q. The ‘following statements are equivalent: 


(i) There exists an element a € E satisfying (++). 


5' 


(ii) + [14 € Nga(E) where ᾳ, 2 = Q; 1 Q. 


(iii) (+ Πα: m) τ +1 forall p € Ωρ. 
P 


fw) (4 Ila. m) = +1 for p|disc(E/Q) and p = 2, 41, ...» 4s - 
Ρ 


i=1 


Proof (i)=> (ii) Let α satisfy (**). Using the formula 
ord (Ν ε,α(α)) = Σὺ [Ρ) ord,(«), 


it follows that ord,(N¢g/q(«)) is even re those p for which ΡΞ q;for anyi = 1, 
.., 5΄. So consider p = 4; for some i= 1, ..., s’. If p ramifies in E, then 

f(p|p) = 1 and ord,(Ng/q(«)) is odd. If p splits; in E, let p, § denote the prime 
spots lying over p. First, if p = Q;, § = Ὦ; for some i, j = 1,..., s’ with i # j, 
we get ογά (Ν, α(α)) = ordg(«) + ordg ω: is even since both orde(a) and 
ord, (a) are odd. On the other hand, if p # Q, for any j =1,..., 5. then 
ord (Ν εἰα(α}) is odd since ord,(«) is odd and ord, 5(a) is even. Thus, we can 


50 A. G. EARNEST AND J. S. HSIA 


write ΝΕ ο(α) = +] [ps p*? where δ 1s odd for those p for which p = q; for 
some i = 1,...., δ' and p # q; for any i # j, and ¢, is even for all other primes 
p. The element β has the desired norm where β = ya with y defined by 


(ε, — 2)/2 if D> 4, ΞΞ 4): ix j 
y= [|p = with v,={(e,—1)/2 ἢ e, is odd 
peS 


5} otherwise. 


Pp 
(ii) => (i) Suppose βε E with Ng α(β) = ἘΠΠΣΞ, ᾳ. Distinguish a set U 
of primes by the following: If p = q; for somei = s’ + 1,..., sand ord,,(f) is 
even then pe U; if p=q,;=4q, for i#j and ordg({f) is even (and thus 
ordg (β) is also even), then p € U; otherwise p ¢ U. The desired element « 
satisfying (*«) is given by a = τβ where 
: O if p¢éU 
a ΤΙ ᾿ς ΡΤ] καὶ pew. 

(ii)<> (iii) Let y denote the element +[]f_, 4;. Then y € Ng/g(E) if and 
only if γ is represented over @ by the quadratic space [1, —m]; that is, 
[1, —m] = [y, —ym]. By the classification of rational quadratic forms this 

Q 


isometry condition is equivalent to the equality of the local Hasse symbols 
S,([1, —m]) = S,([y, —ym]) for all p € Qg. A calculation of these symbols 
shows that this condition is in turn equivalent to the condition (y, m), = +1 
for all pEQg. 

(iii)<> (iv) These conditions are equivalent because (y, m), = 1 for all 
odd p that do not divide either m or y. QED 


For the purpose of the present problem, we consider those elements 
satisfying the conditions (*«) where the set Q,, ..., Q, varies through all 
possible subsets of the set .@(L). For each of these subsets obtain one ele- 
ment of E satisfying these conditions (if such an element does in fact exist); 
let a,, ..., a, be a list of these elements. Always choose α, to have 
Nx /a(%;) > 0 whenever possible and arrange the elements in such a way that 
Ne a(%;) > 0 for i= 1,..., μ' and Ny α(α!) «0 ifi = μ' + 1,..., u (of course, 
u’=u if m>0 with Ng α(ε) = —1 or if m < 0). Our discussion can be sum- 
marized in the following theorem. Recall that p,,..., p, denote those rational 
primes that divide disc(E/Q). 

Theorem 1.7 Let L be a Z-lattice with dim L > 3 and L, modular. Let 
J#1€ Go(L). ψ,0) = 1 € Y,(L) if and only if there exists « € E for which 
(a)h(j) ε JE and: | 

(i) If Lis definite, m> 0 and Nz o(é) = +1, « is a product of elements 
from the list 1, & py, ..., Pys Oy) νοις Oy 
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(ii) If Lis definite,m> 0 and Ny ale) = —1, « is a product of elements 
from the list 1, py, ..-, Dyes X15 -- +5 Ly: 

(iii) Ifm <0 or if Lis indefinite, « is a product of the elements 1, δ, p,,..., 
Digi Oys sing Oa 


Example 1.8 Consider the lattice L of Example 1.2. The map yf, 1s 
injective if and only if w,(j) is a nontrivial element of Y,(L). We consider 
several extensions E. 


(i) E=Q(./3). We have w,(j) = (2)W.(/) = ψε(. Furthermore, 
Ajo € ΕΖ for p = 190, since 19 is inert in E/Q. Thus, ,(k) = (1) and yw, is 
not injective. 

(ii) E = Q(,/5). Since 3 is inert in E/Q, we have A; ε E? for p = 30 
and w,(j) = (1); again w, is not injective. 

(iii) E = Ο(., 7). According to Theorem 1.7 we need only check the 
multiples of w,(j) by products of the elements 1, 2, 7, δ, and B where 
e=8+3,/7 and Nega(f)=2. First consider ¢. Since J/7= +1 in 

3 = Z/(3) and 171 = +8 in Q,, = Z/(19), ε is seen to be a nonsquare at 
each of the spots lying above 3 and 19. So (é)W,(j) # (1). We may choose 
p=3+ J 7. Local calculation shows that β is a square at one spot lying 
over 3 and a nonsquare at the other. Thus, (B)W,(j) # (1). Hence, w,(j) # (1) 
and w, is injective. 


2. Spinor Norms in Quadratic Fields 


The problem of computing the spinor norms of local integral rotations 
on a lattice K at the spots p of Q(./m) has been completely solved (see 
Kneser [11] and Earnest and Hsia [5]) unless p is a ramified dyadic spot; i.e., 
p|2 and e(p|2) = 2. In this section we calculate these spinor norm groups 
for the particular case of such localizations of a lattice L lifted from Q. Since 
2 ramifies in Q(/ m), m must be congruent to 2 or 3 modulo 4, and the 
results here vary significantly depending upon the specific congruence. The 
computations depend strongly upon the fact that the residue class field of 
Q(/ m) at p contains only two elements; in particular, any sum of two units 
of οί m), lies in the ideal p. 

In this section, we adapt our notation to this purely local situation. A 
dyadic prime spot on Q(,/m), m = 2 or 3 (mod 4), is denoted by p and F is 
the completion of οί m) at p. 1,11 be ἃ lattice over Z,, the integers of Q,, 
and so L is a lattice over o, the ring of integers of F. Since the spinor norm 
groups are unaffected by scaling, we shall assume sL = Z,.u, and u, are the 
groups of units of Z, and o, respectively. Finally, 2 denotes a fixed prime in 
F; i.e. πο = p. Then 2 = x7f for some β ε o. In the following proposition, 
which makes clear the local distinction of the cases m = 2 or 3 (mod 4), we 
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describe a choice of the element π and examine the resulting 8. The symbol 
QZ denotes the quadratic defect as described in O’Meara [12]. 


Proposition 2.1 2(2) = p><>m = 3 (mod 4). 

Proof If m=2 (mod 4), write m=2m', m' an odd integer. Let 
n= ./m/m’. Then x? = m/m? = 2m'/m? =2/m’. So, 2= 12m’; that is, 
β =m’ εὐ}, so 9(β) p*. Hence, 9(2) Ξ p° and it follows that (2) = 0. 

Conversely, if m = 3 (mod 4), then write m= 3+ 4t and let a=2 Ὁ 
2t — ./m. This a € 0 and a direct calculation shows N ειρε(α) Ε 112) and it 
follows that « € u,. Choose π = (./m -- 1),α. Then 2 = πἦα. Furthermore, 
a=1+(1+2t+ "/m) and Nrjg,(1 + 2t + ./m) is a prime element of 22: 
so 1+ 2t+ vm i is a prime element of o. In this case, D(a) = p and 2 = na 
gives Z(2)= p>. QED 


The next Proposiion reduces the problem of calculating spinor norms of 
all elements of O*(£) to that of performing the calculations only for the 
symmetries in O(L). 


Proposition 2.2 O(L) is generated by the symmetries S,, and Eichler trans- 
formations Εἰ, in O(L). 

Proof Apply induction on dim L. It suffices to consider those L with 
sL= 2Z,. For such a lattice. L=L, 1M with L, unimodular and 
sM © 2Z,. If L, 1s split by a hyperbolic plane the result follows as in 
O’Meara and Pollak [13, 14]. Since the orders of norm and weight generators 
of L, are both even, L, is split in that way whenever dim L, > 3. So we need 
only consider dim L, < 2. Furthermore, if L,; = A(2, 2) or L, unary, the 
result follows as in proof of Proposition 2.1 of Earnest and Hsia [7]. 

Suppose L, = A(e, 26), with e, 6 € uz, in basis {x, y} and σ € O (L). Then 
at least one of the symmetries S,,,,_, of ὅς, σω)- , lies in O(L) and the result 
follows by induction since ox splits L and there is a z with S, € O(L) for 
which δ, σ or S, Sya fixes x. If L, = A(1, 0) or A(1, 4) in basis {x, ¥}, then one 
Of Souy—-x OF Ss... ,6(x)-x lies in O(L) and the result follows similarly. This 
exhausts all possibilities. QED 


We need one further calculation in order to simplify the statements of 
later results. 


Lemma 2.3 Let d € u,. Then the F-space[1, d] represents all units h that 
lie in 1+ p?. 

Proof The quadratic defect Y(—d) is contained in ρ΄. If J(—d) « 40, 
then the result follows since the Hilbert symbol (h, —d), = +1 for any 
heu,. So we treat the case D(—d) = ρ΄. Write -d=1+7%e,ceu,.A 
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basis {x, y} for the space can be chosen for which the corresponding matrix is 


1 1 
1 —7ne} 


We shall show there is a vector v = x + Cy with C ε F for which Q(v) = 
By perfectness of the residue class field, h can be written ash = 1 + 2r,r € p. 
Now Q(v) = 1 + 26 — n3eC? = 1 + 2(C + ne’C’), εἰ € u,. The polynomial 
f(X) = ne'X? + X — ris reducible by the reducibility criterion of 13 : 9 [12]. 
That is, f(X) has a linear factor, and thus a root in F. Taking C equal to this 
root gives the desired vector. QED 


We now proceed with specific calculation of the spinor norm groups. 
These groups were computed in Hsia [9] for modular lattices with no restric- 
tions on the nature of the local field. From results obtained there, it can be 
seen that 0(O*(L£))=u,F? whenever dim L>2 and L modular but 
L# A(e, 26) where ε, 6 € u,. Proposition E of Hsia [9] applies to this excep- 
tional case, but does not yield a solution in closed form. However, in the 
present context the situation is seen to be considerably simpler. 


Proposition 2.4 Let L= Α(1, 2y), y € ug. 


(i) Ifm=2 (mod 4), then 0(O* (L)) = (1 + p?)F?. 

(ii) If m=3 (mod 4), then 0( ey = ᾿ F?, 

Proof We first show that (1 + p?)F? © θ(ο" ([)}: in any case. The lattice 
L can be rewritten over Z, as L=<1> 1 <d>, d=det Le u,. Let 
h=1+2t,t € o,. Then g<h, h~'d> = ho? + 20, = οὐ + 20, = 9(1, d> and 
the spaces [h, h~'d] and [1, d] are isometric over F by Lemma 2.3, so 
<h, h~'d> and <1, d> are isometric over ὁ by 93 : 16 of [12]. In particular, 
(1, d> represents ἢ over o. Since h € u,, the vector of L of length h gives rise 
to a symmetry of I. 

Next, suppose v = Ax + (γε P(L), where {x, y} is a basis adapted to 
A(1, 2y). Further, write 2y = 2?f noting that fe u, when m = 2 (mod 4) and 
Bf) =p when m = 3 (mod 4). If Ae u,, Q(v) is clearly an element of 
(1 + p*)u2. So assume A € p and, without loss of generality, C = 1. Thus, 
Q(v) = A* + xf. If ord, A = 1, then Q(v) € p*; but B(v, x) Ε u,.Sono such 
v can give rise to an ” integral symmetry. If ord, A > 2, it is clear that 
ord, Q(v) = ord,(x’f) = 2 and, thus, we get 9(0* ([)) « < u, F?. In the case 
m= 2 (mod 4), this containment can be strengthened because Q(v) = A* + 
nf = nf (1+ A?nx 7f~') and ord, A > 2 shows that 


1+ A’n”*4f-'e 1+ p?, 


but furthermore, [Ε 1 + p? sincefe up. So Ο(υ) ε e (1 + p*)u2 and we obtain 
the reverse containment, (1+ p?)F? > 0(O*(L)). However, when 
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= 3 (mod 4), Q(v) ¢ (1 + p*)F? for the vector v = 2x + γε P(£). So in 
shal case, 6(0* (L)) = u, F?.. QED 


Remark 2.5 In the present context, 1 + p? = u2(1 + ρ2) is a subgroup 
of u, of index two. 


Hence we see that the spinor norm group 6(0* (L)) has a very convenient 
form when L is modular. To compute the general nonmodular case, we will 
be able to determine the spinor norms using the components of a Jordan 
splitting and sublattices of the form <1) L <2’y>),r > land y € u,. We now 
turn our attention to these binary nonmodular lattices. 


Proposition 2.6 Let m=2(mod 4) and L=<1,2y>, yeu,. Then 
θ(0" ([)) = (1 + p?)F?. 

Proof Let {x, y} be a basis for <1, 2y>. Write 2y = x*f where, since 
m = 2 (mod 4), f can be chosen from u,. In particular, f= 1+ 29, g € υ2. 

To show the containment ¢, take v = Ax + (γε P(L), A, Ce o. If 
Aéu,, C € p it is clear that Q(v) € (1 + p?)F?. If both A εὐ, and C € u, 
assume A=1 and write Ο(υ)-- 1 - π202[Ξ 1 Ὁ nC? + 2n?C2g = = 
(1 + 2C)* + 2n*C7g — 2nC ε (1 + p?)F?. When A ε p, assume C = 1; then 
if ord, A > 2, Q(v) = nf (1 + Α2π᾿ 77) ε (1 + p?)F? since fe 1+ p The 
case C = 1, ord, A = 1 does not give a symmetry in O(L). This exhausts all 
possibilities. 

Conversely, if he 1+ p?, then <1, f> = <h, ΩΣ ΤΡ. by Lemma 2.3 and 
93 : 16 [12]. So there are s, t € ὁ for which h = 52 + εἶ If t € u, the vector 
v= nsx + ty of L has length xh. If t € p, the vector v = sx is tn~y has 
length h. QED 


Proposition 2.7 Let m=3 (mod 4) and L=<1,2y>, yeéu,. Then 
0(0* (L)) = {n ε F: (n, —f), = +1} where 2y = nf. 

Proof Consider the lattice Y = <1, f> over o in basis {x, w}. The lattice 
with basis {x, nw} i is isometric to L, so we may view Las a sublattice of #. In 
particular, 6(0* (L)) < 0(0* (#)). Conversely, suppose 


v= Ax + Cwe P(f#). 


IfAeu,, Ce p, say C= πζ, then u = Ax + Cy ε P(L) with QO(u) = O(v). If 
Ceéu,, assume C= 1 and consider the vector u= Amx+ye C. Then 

Q(u) = 72Q(v) and it remains only to verify that S, ε O(L). When A € p, 
Q(u) € 2u, and S, ε O(L). If A € u,, then we observe that Φ() = p since 
m=3(mod 4) and _ that Οὐ) ] = = |A*+f|=|a|. Furthermore, 
2B(u, L) = p> = Q(u)o and S, ε O(L) in this case also. Thus, we have shown 
that 6(0*(L)) = 6(0*()). But the group on the right can be determined by 
Proposition B of Hsia [9] and is seen to equal the set of elements satisfying 
the Hilbert symbol (η, —f), = +1. QED 
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Remark 2.8 In the case described by the above proposition, 0(0* (L)) 

has index two in F. This is the first instance in which we do not have 

O*(£)) < u, F?. It will be seen (Lemma 2.19) that this is the only way in 
which this containment can be violated. 


The calculation of θ(0 " (L)) when L = <1, 4y>, » € uz has been done in 
Proposition 3.1 of [7]. Here there is no distinction between m= 2 or 3 
(mod 4) and, combining this result with Lemma 2.3, we obtain: 


Proposition 2.9 Let L = <1, 4y), y € uz. Then θ(0" (£)) = (1 + p?)F?. 


Additionally, the cases L = <1, 2”y), Ε 112,7 > 4 are determined by 3.3 
and 3.4 of [7]. So the binary cases will be completely resolved with 
the calculation of 6(O*(L)) for L= <1, 8y>. Again in this case, we 
treat separately m= 2 (mod 4) and m=3 (mod 4). In either case, if 
{x, y} is a basis for the given splitting and v = Ax + Cy ε P(L), restrictions 
on the possible values of A and C are given by Proposition 3.2 of [7]. From 
these restrictions, it can be easily seen that 0(0*(L)) Ξ u, ΚΖ. 


Proposition 2.10 Let m=2(mod4) and L2<1,8y>. Then 
θ(0" ([)) = (1 + p?)F?. 

Proof It is easy to see that (1 + p*)F? < 0(0*(L£)) Ξ (1 + p?)F? by 
using Proposition 3.2 [7]. So it will suffice to show that for any τ = 1 + 278 
with ¢ € u,, there is v ε P(L) with Q(v) ε τ 2. We shall show there is such a 
» = Ax + Cy with Ce u, and A = 2a,a€ u, 

Since m = 2 (mod 4) we can write 8y = 4n2f where f € u,. In particular, 
—fel+p? and C?€1+p? gives —C?n?f = n7(1 + 2s) for some 5 € 0. 
Consider 


2,26 -- 2,.,20 -- 2 2πί(ε — 1) + xs) 

t— C’n?*f=14 2πε — C’n’*f=(1 +2)" |1 + (1+ πὴ 

Ife ¢ 1+ p’, then by taking C = 1 + a we see that s € τι, and the expression 
in square brackets 15 of the form 1 + 42, 7 € 0 aud is thus a square by the 
local square theorem; say (1 + π)(1 + 4πη) = = a7,aeu,. Then the vector 
v=2ax+(1+2)y has length Q(v) = 4a? + (1 ΕἸ n)4n2f = A(a? + (1 + 
π2) fn?) = 4(τ — (1 + x)?n2f + (1 + π)2π20) = 4c € tF?. On the other hand, 
ifee 1+ p?, then taking C = 1 gives se p and again the expression in square 
brackets is a square; say «? = (1 + π)}(1 + 4πη) again. This time v = 20x + y 
is the desired vector. QED 


Proposition 2.11 Let m=3 (mod 4) and L= (1, 8y>, ye uz. Then 
0(0+ (L)) = {n € u,: (η, —f), = + 1}F? where 8y = 4n°f. 
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Proof We again view Lasa sublattice of f = <1, f > {x, w}, y = 2aw. 
It suffices to show that 6(0*(L)) = (u, αὐ Q(P(L)))F?. Let u= Ax + 
Cy ε P(L). If Aeu,, then Q(u) ε F?. If Ce u, we may take C = 1 and 
restrictions on the value οἱ A are given by 3.2 of [7]. For 0 < ord, A <2, 
the vector v= x + 272A ἷν lies in P(#) and has length Α΄ 2 A(u). For 
ord, A>4, the vector v= (A/2x)x +w lies in P(Y) and has length 
(1/27)?Q(u). This shows 6(0* (E)) Ξ (u, ὦ Q(P(L)))F?. 

Conversely, let v= Ax+Cwe P(¥), Q(v)eu,. If Ce p%, write 
C = 2nCy, Co € 0,, and consider the vector u = Ax + Coy. a vector has 
length Q(») and lies in P(L). If 0 « οτά, C <2, then 2nC7! eu, and 
v = 2nC~'x + γε P(L) has length Q(u) = (2nC- *)?Q(v). So we are left with 
Ceu,; take C = 1. The vector u = 2nAx + ay has length Q(u) = (22)? x 
(A? + tf ) = (2 π)ο(υ). When Ae p, 2πΆ ε p* and so με P(L). Finally, if 
Aeu,, then |Q(v)| = |x| since Df) =p, and Q(v) does not lie in 

O(P(L)) 0 u,)F?. QED 

We turn now to the question of determining 6(0* (L)) for a general L by 
considering a Jordan splitting of L. The results vary markedly depending 
upon the congruence of m mod 4. We treat first the case m = 2 (mod 4). In 
this case, we have observed for binary L that 0(0*(L)) «Ξ τι, F?. This result 
can be extended to lattices of arbitrary dimension by an inductive argument 
and a consideration of all possibilities for the structure of the first Jordan 
component of a splitting of the given lattice. The result is stated here without 
proof. 


Lemma 2.12 If m= 2 (mod 4), then θ(0" (L)) « u, F?. 
This lemma has several immediate consequences. 
Theorem 2.13 Let m = 2 (mod 4). 


(i) If L has a Jordan component of dimension 3 or more, then 
0(0* (L)) = u, F?. 

(ii) If L has a binary Jordan component which is not of the form 
2’A(e, 25), re N ὦ {0}, ε, 6 € ug, then θ(0 (L)) = u, F?. 

So we may now assume that L has a Jordan splitting 
L=L, 1 27°L, L-+: 1 2"L, with dim L; <2 and all binary L,; have the 
form Α(ε, 26), ε, 6 € uz. 

We first treat the case when all L,; are unary. 


Theorem 2.14 Let m = 2 (mod 4), 
= LX, Β ϑ χὴν ΒΕ Ζ, χ, = {218} ΒΕ {280 at ΝΣ ΒΕ {2" ς;) 
be a Jordan splitting of L. Then 


even 


6(0* (L)) = \[] QO(v): v εἰ P(ox; 1 0x;41), 1<j<t— 1}. 
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Proof This result has already been obtained in 3.8 [7] unless there are 
indices k, 5, and t for which r,,, —1r, = 1 or 3 andr, — Γι = 2 or 4. For such 
lattices we have established that 


0(0+ (£)) > (1 + p?)F2 = 0(0* (ox, + 0x,+41))- 


So it suffices to show that 06(0*(£))=(1+ p?)F?. For any vector 
vy = δὲς A,x; € P(E), the condition m = 2 (mod 4) assures that there is 
exactly one index i = ig for which A?Q(x;) has maximal value. But since each 
A?Q(x;) has even order, it follows that 


Q(v) = Q(A;, ΧΡ) + 2, O(4:X)) Ε QO(Ai, Xio)(1 + p 2) 


Ξ- (1 - ρἢς ([ - p?)F?. QED 


We now treat the case when there is at least one binary component and 
all binary L; have the form Ale, 26), ε, 6 ει. By Proposition 2.4, θ(0 (0) 
contains the subgroup (1 + p \F 2 of index two in u, F?; thus, θ(ΟἿ (ἢ ) must 
equal either (1 + p ἽΡ᾽ or u,F?. If the components are not too closely 
bunched, only (1 + p?)F? is attained. 


Theorem 2.15 Assume that m = 2 (mod 4) and L has a Jordan splitting 
of the type described above. Then 6(O* (L)) = (1 + p?)F? unless Lis split by a 
sublattice 2" Καὶ of the form K = <e> 1 2A(n, 2μ) or Καὶ = A(y, 2n) L 2<e>. In 
this case, θ(Ο " (L)) = u, ΕΖ. 


Proof Suppose L contains a sublattice 2’K of one of the given types in 
basis {x, y, 2. If Καὶ = <e> 1 2A(n, 2u) the vector v= 2x+ay+ze P(K) 
has Q(v) ¢ fi + p?)F?. Since 0(0*(A(n, 2u))) = (1 + p”)F? has index two in 
u, F?, θ(0" ([2) must be all of u, F?. 

To show that in all other cases θ(Ο" (L)) = (1 + p?)F?, apply induction 
on the number of Jordan components making use of the following lemma: 


Lemma 2.16 Suppose L= K | M where Καὶ = A(n, 24) or K = (εν and 
M © 4Z,. If 0(0O*(M)) S (1 + p?)F?, then 0(0* (L)) Ξ (1 + p?)F?. 


Proof We shall write only the case Καὶ = A(n, | 2) since the argument for 
K =e) is contained therein. For any v € P(L), write v= Ax +Cy+z 
where {x, y} is a basis for K, A,C εὖ and ze M. If ord, A <1, then 
Q(z) ε Q(Ax + Cy)(1 + p?). When ord, A > 2,v € P(L) forces either C € αν 
or both A and C lie in ρ΄. In the latter case, if Q(z) 4u,, then 
patie hae + p”). We are left with the case Q(z) € 80 and 
|Q(v)| = |Q(z)|. If ord, Q(z) = ¢, the condition v € P(L) gives A, C € ρ' "2. 
δο 42, (3, "ἢ AC all lie inside ρ2 4 and, since t > 6, we have 2t — 4 > 
t+ 2. Thus, Ο(υ) € O(z)(1 + p?). This exhausts all possibilities and the 
lemma is proved. QED 
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We now complete the case m = 3 (mod 4). The Theorem 2.14 remains 
true also in this case. 


Theorem 2.17 Let m=3 (mod 4) and let L= Z,x, 1 τ... Z,x,= 
(218. 1. τ... (2"e,5 be a Jordan splitting of L. Then 


even 


0(0* (L)) = | [] O(v): υ € P(ox; 1 ox;4,), 1<j<t— 1}. 


Proof The result follows from Theorem 3.8 of Earnest and Hsia [7] 
unless there are indices k, 5, and t for which r,,,—r,=1 or 3 and 
r,—r, =2or4. Ifr,., τοῖν = 1 andr, — r, = 2, then 0(0*(L)) = F because 
6(O* (ox, L ox,)) = (1 + p?)F? ¢ 0(0* (ox, 1 ox,41)). This follows from 
the duality lemma of Ankeny and Hsia [1] which shows that for —f with 
BY(—f) = p there is an element μ᾿ with D(u) = p* for which (u, —f), = —1. 
SO we may assume there are no such indices. 

For a vector v = δὲ. A; X; € P(E), let k be the largest index for which 
ord, Q(A, Χι) is minimal. If there is another j < k with ord, Q(A, Χμ) = 
ord, Q(A;X,), it follows easily that j=k—-— 1, r;=r,—1, A, Εν.» and 
|A;| = |x|. Since r,—r, #2 for any 5, t in this case, we have r,—r, >4 
for h<k-—1 and +r,-r,>3 for h>k and it follows that 
Q(v) € O(A,_ , X,-1 + A, X;,)F?. So assume minimality occurs only at the 
index k. Then |A,|> [2] since δὶ ε Ο([ἢ. Suppose A, εὶς and that 
ἵκει —1, = 1. We treat here the case r,,, =r, +1. If j>k+ 1, then 
rj—1r, Σ 4 and Q(A,X,) € 8Q(v)o and similarly if j < k with r, —r,; > 4. If 
ry ="%-1 +3 and A,_, € 4u,, we have |Q(A,_, X,-1)| = |2| |Q(v)| anda 
calculation shows that 


Ο(Α,. 1 Xn—1 + Anat Xe) = Age 12™* (ay_ 8? + ἀκ ει) 


where δ € u,. The last factor, being a sum of units of defect < p>, lies in 20. 
That is, Q(A,_~,X,—1 + Ages X41) € 4A(v)o. On the other hand, if 
Ay+1 € p, then Q(A, 41 X,+1) € 4Q(v)o and Q(v) € Q(A,—1 Xy—1 + A Xx) x 
(1 + 40) and the result follows. Keeping the assumption r,,, — 7, = 1, the 
other possibilities |A,|= [π| and |A,|= |2]| are resolved in a similar 
manner. 

The above argument shows that when there is no index k with 
ry+1 — Γι = 1 we have Q(v) € u, F? for all v ε P(E). If this is the case and 
there are indices ἰ, t, and 5 with r,,, — r; = 3 andr, — r, = 2 the result again 
follows since 6(O* (L)) = u, F?. So to complete the proof we need only verify 
the result for a v ε P(L) whose value equals |Q(A,X,)| andr, — 1%, >3 
and r,4; —1, > 3. This can be easily done using an argument analogous to 
the one given above. QED 


When L does not have the diagonal decomposition discussed above, 
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θ(Ο" (L)) contains νι, ΕΖ since this is the spinor norm group for any modular 
component of dimension greater than one. The exact value is then calculated 
by the following theorems. 


Theorem 2.18 Let m= 3 (mod 4) and L have a Jordan component of 
dimension greater than one. Then 0(O* (L)) = u, F? if and only if Lis not ie 
by a sublattice 2" ζε, 2y), & y € uz. If Lis split in this way, then 0(O* (L)) = 


The second statement is clear since 6(O* (<e, 27)}} has index two in F. 
The first follows immediately from the following lemma, which is proved by 
induction on the number of Jordan components in a splitting of L. 


Lemma 2.19 If L is not split by a sublattice of the type 2" (ε, 2y>, then 
6(0*(L))  u, ΕΖ. 


3. Applications 


In Earnest and Hsia [6] a construction was given that produced a lattice 
L having arbitrary dimension d and g* (L) = 2” for any prescribed integer ἢ. 
Now, additionally, let k be any positive integer with k < n. We now give an 
alternative construction of a lattice L having both the properties above but 
also for which an infinite number of extensions E = οί. m) can be con- 
structed so that |Ker(y,)| = 2* in each of these extensions. 


Example 3.1 Let g,, ..., 4, be distinct odd primes for which the 
Legendre symbol (2/q;) = 1 for i= 1, ..., n. The lattice L= <1, p?, 
B?4-) has dimension d and g*(L) = 2", as can be easily calculated using 
the technique described in Section 1. In particular, the primes q,; may be 
chosen so that 4; = 1 (mod 8) for each i= 1,..., ἡ. 


Proposition 3.2 There are infinitely many extensions E = οι. m) for 
which |Ker(w,)| = 2" where the lattice L is constructed as above. 
Proof Let p be a prime that satisfies: 
(i) p=3 (mod 8); 
(ii) (p/q;) = —1fori=1,...,k; 
(iii) (p/q;)) = +1 fori=k+1,. 


We claim that when E = at/— p), ἐπ w,)| ΞΞ 25. First note that 
—peéA,,Q2 for i=1,..., k. So the idéles j(i) defined by 


ve _ A, if q -Ξ qi 
GD) = 1 otherwise 


all have the property that w,(j(i)))=(1) for i=1, ..., k; hence 
|Ker(,)| > 2". It remains only to show that no element j = | [7-441 (δ 
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δι € {0, 1}, becomes trivial in Y,(L). By Theorem 1.7, j € Ker(w,) if and only 
if h(j) € yJE where y is a product of the elements —1, 2, and p. Since 
q: = 1 (mod 8), —1 and 2 are squares in ΟΣ and the choice of p assures 
pe δ fori=k+1,..., n. Thus, \Ker(w,)] = 25. 

The p used above can ἢ δὲ chosen to be any prime that is a solution of the 
linear congruence system 


Y = 3 (mod 8) 
Y=t, (mod 4) 


Y =t, (mod q,) 


where t; € Z satisfy (t;/q;) = —1,i=1,...,k and (t;/q,) = +1 fori=k +1, 
ον πη. QED 


Remark 3.2 For the lattices L and extensions E constructed above, it 
can be shown that, although the map y, is not injective, the number g* (L) is 
at least as large as g*(L). The next example will show that the value of g* 
may decrease in passing from Q to E. 


Example 3.4 Consider the space V = [1, 1, —1, —1] over Q. Let K be 
the Z-lattice on V given by K = <—1, -- 52, 54, 5°) and let J, be the lattice 
on V, given by 


By 81: 14 of [12], there is a lattice L on V for which L,.~ Καὶ, for p odd and 
L, = J,. This lattice L has g*(L) = 2 but g*(L) = 1. 

By using the machinery established in Earnest and Hsia [6, 7], and 
Section 2, sufficient conditions in terms of the lattice L and the extension E, 
for the number σ΄ to be nondecreasing in passing from Ὁ to E = Q(./ m), 
can be obtained. Another group homomorphism y can be constructed, 
n: Ga(L) > G,(L), and shown to be injective under the given condition. 
Details of the construction can be found in Earnest [2]. Let W(E) denote the 
set of all rational primes that divide disc(E/Q). 


Theorem 3.5 Let L be a Z-lattice, dim L > 3. Then 
&(L) 0 S(L) = [5 > g* (L) = σ᾽ (L). 


Corollary 3.6 If no rational prime divides both disc(E/Q) and det L, then 
g*(L) 5 σ᾽ (ἢ) 
The computations in Section 2 can be used to generalize to quadratic exten- 
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sions the results of Kneser [11] and Earnest and Hsia [5] giving sufficient 
conditions, in terms of the reduced determinant of a Z-lattice L, for the 
number g*(L) to be one. We obtain here a somewhat weaker condition 
which guarantees only that g*(L) = 1 when L is lifted into a quadratic 
extension with odd ideal class number. 

Notation used here will conform to that introduced in Earnest and Hsia 
[5]. In particular, define d’L to be d’L = det(I’ *) where nL = aZ. If d(L) 
denotes reduced determinant defined in Eichler [8], d'L is related to 5(L) by 
the formula 5(L) = 2"(d’L)Z. Note in particular that, in contrast to 6(L), d’L 
is not always an integer but may be rational with 2-power denominator. So 
d'L =[], p” with 5,6 Ζ and s, positive for p odd (s, may be negative). 

Let E = Q(./m) and let the ideal class number of Ε be hg = h(E) " ho(E) 
where h,(E) is a power of 2 and h(E) is odd. We seek to determine the best 
possible bound & so that the conditions 


(i) s,< (n(n — 1))/2, n= dim L, 
(ii) 5) «99 


will imply that g* (L) < h,(E) (or that g* ([ < 2h,(E) when L definite, m > 0 
with Ngg(e) = -- 1). In particular, when hg is odd, we obtain g*(L) = 1 (or 
g*(L) < 2, respectively). The bound & varies depending upon the field E. Of 
course, if 2 is unramified in E, then the bound @ is exactly as determined in 
[5]. So we restrict our interest to the cases m = 2, 3 (mod 4). The proofs of 
the following results are consquences of the results of Section 2 and the index 
computations in Earnest [3]. 


Proposition 3.7 (i) If m= 2 (mod 4), the best possible value of B is 


p= = 3), za 


when m «0 or m > 0 with ΝΕ (6) = +1 (6 denotes the fundamental unit of 
(ii) If m=2 (mod 4), m>0 with Neale) = —1 and the lattice L is 
indefinite, then the best possible bound 935 is 2n(n — 1). 


[Examples showing the given bounds are best possible are 
L= 1 2"? (27‘yA(1, 2) in case (i) and L = ς---751 (1 Pr} <2“) in case 
(ii). ] 

Remark 3.8 In case m = 3 (mod 4), it is not as easy to determine the 
best possible value of &. If κι < 0, the result is as follows: 

(i) For m= —1, best possible value is B” = n(n — 1). 

(ii) For m+ —1, best possible value is B’ = n(n — 1) — [n/2} 


For m > 0, the best possible bound may be either Β΄, B”, 3B”, or 2B” and the 
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precise value can be determined by checking whether +2¢€N ra(E), 
whether the trace ΤΥ; α(6) = 0 (mod 4) and by evaluating the symbol (ε, f), 
where p|2 and 2 = nf in E,. ! 


Remark 3.9 Precise values of g*(L) are determined for unimodular lat- 
tices L in [3]. Also obtained as a consequence of the index computations 
there is a crude upper bound for the values of g*() for an arbitrary L. 
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On Products of Consecutive Integers 
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1. Introduction 


We define A(n, k) = (n + k)!/n! where n, k are positive integers and we 
wish to examine divisibility properties of A(n, k). 
First observe that the cases k = 1, 2 are special. The relations 


tA(n,1)= A(m,1) and  tA(n, 2) = A(m, 2) 


each have infinitely many solutions n, m—the first for every positive integer 
t; the second for every positive integer that is not a square, as can be seen 
from Pell’s equation 


(2m + 1)? — t(2n+ 1)? Ξ 1 -- ἰ. 


On the other hand, it is well known from the Thue-Siegel theorem that 
for given k > 3 and fixed t > 1 the equation | 


tA(n, k) = A(m, k) (1.1) 


has only a finite number of solutions in integers n, m. 
+ Research of the second author was supported in part by Grant MCS 76-06988. 
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It is possible that (1.1) has only a finite number of solutions for fixed t 
and variable k > 2, but we cannot prove this even in the case t = 2 without 
additional hypotheses on n. Perhaps the following stronger conjecture 
holds: A(n, k) = tA(m, 1) has only a finite number of solutions m > n + k for 
every rational t if min(k, J) > 1, max(k, 1) > 2. 

Let f (n, k) be the least positive integer m > n so that 


A(m, k) = 0 (mod A(n, k)). (1.2) 


We obviously have f(n, k) < A(n, k) — k and it is easy to see that for k > 1 
we get several residue classes (in addition to —k) (mod A(n, k)) for m which 
ensure that (1.2) holds. The number of these residue classes is always larger 
than k, in fact exponential in k, so that the above inequality on f(n, k) can 
always be improved. 

The algebraic identities 


x(x + 1)(x + 2) = (x? + 2x)(x + 1) 
which divides (x? + 2x)(x? + 2x + 1) 
x(x + 1)(x + 2)(x + 3) = (x? + 3x)(x? + 3x + 2) 
x(x + 1)(x + 2)(x + 3)(x + 4) = (x? + 4x)(x? + 4x + 3)(x + 2) 
which divides (x? + 4x)(x? + 4x + 3)(x? + 4x + 4) 


show that A(n, 3) divides A(n* + 4n + 1, 3); A(n, 4) divides A(n? + 5n + 
2,4); and A(n, 5) divides A(n* + 6n + 4, 5). Thus f(n, 3) <n? + 4n+ 1, 
f(n, 4) <n? + 5n+2, f(n,5)<n?+6n+4. It seems likely that these 
bounds are attained for infinitely many, perhaps for almost all, values of n. 
One might ask whether we get f(n, k) < n‘~® for all (almost all) large n and 
some 0 >0 when k > 5. In the other direction we would like to know 
whether f(n, k) > π᾿ ὅδ᾽ for infinitely many (almost all) n for some 6 > 0 
when k > 1. For k = 2, we have infinitely many n with f(n, k) ~ /2n. Are 
there infinitely many n with f(n, k) < cn for fixed c, k 22 

A function closely related to f(n, k) is g(n, k), the minimal integral value 
A(m, k)/A(n, Κ), m>n. The above discussion shows that g(n, k) ~ 
f(n, k)‘/A(n, k) for fixed k and thus g(n, k) <n* for k <5. Table 1 gives 
values of f(n, k), g(n, k) for small n and k. 

One may try to estimate the density d(n, k) of integers m for which (1.2) 
holds. Obviously d(n, 1) = 1/(n + 1). For k = 2 we have 


d(n, 2) = 204. 20/A(n, 2), (1.3) 


where w(x) denotes the number of distinct prime factors of x. Relation (1.3) 
follows from the fact that A(m, 2) is divisible by A(n, 2) if and only if we can 
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TABLE 1 


fo te “Ὁ, ΟΣ ἢ 5 2 3. 4 21 2 4 3 15 
2.532 7 6 3 2 6 14 4 6 3 3. 5 22 
3.ϑ 72 14 1 7 6 4 2 τ] 7. 6 13 646 
4 92 8 3 12 13 6 3. 13 68 22 1794 20 2691 
5 11 2 13 5 13 £410 52 2915 13 34 6 2 11 437 
6 13 2 47 42 25 39 52 1749. 17 57 16 99 7 2 
7 15 2 62 56 53 231 52 1113 31 476 50 18921 21 345 
8 17 2 34 14 42 86 32 [119 50 2703 50 10812 20 138 
9 19 2 43 18 19 7 51 477° 51 1908 20 46 59 68076 

10 21 2 31 8 63 160 62 720 51 1272 60 11346 49 11925 

11 23 2 38 10 25 9 24 15 60 1891 46 1645 62 33902 

12 25 2 7 33 62 96 61 372 «47 420 50 1749 50 5247 

13 27 2 19 2 47 35 31 22 31 42009 151 695981 169 11865205 

i429 2 79 27 117 413 268 72899 131 30954 284 20233213 149 3346915 

15 31 2 254 240 269 4065 302 92415 319 1860516 284 14452295 169 5393275 


factor A(n, 2) into two relatively prime divisors A(n, 2) = d, d2, (d,, 42) = 1, 
and require 


m=-—1(modd,), m= --2 (mod d,). 


Since there are 2°(4 2) such factorizations of A(n, 2), we get that many 
residue classes (mod A(n, 2)) for m. 

For k > 2, the problem of computing d(n, k) becomes messier, but not 
intrinsically difficult. The number of residue classes (mod A(n, k)) to which 
m must belong remains O(n‘) for every ¢ > 0 and hence 


I 1 
n* < d(n, k) < nk-& 


for all values of k. 
Another question is that of determining those m > nso that there exists 
some k for which (1.2) holds. Since for k > m — n, we have 


A(m,k) A(n +k, m—n) 
A(n,k) = A(n, m—n) ” 
which is certainly an integer for k = A(n, m — n) — m, the problem becomes 


trivial unless we restrict the values of k to k <m— ἡ. 
For n = 1 and any m we see that 


aaa ("FP 
A(l,p-1) p\ p-1 
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is divisible by p unless m = 0 (mod p). Since m cannot be divisible by all 
primes <m except when m = 2, we see that for every m > 2 there exists a k, 
1<k<m-—1 so that A(1, k) divides A(m, k). The question whether there 
exists ak, 1 < k < m— 2, so that A(2, k) divides A(m, k), which is equivalent 
to (22) divides (ἢ), seems much more difficult to decide. The general 
problem can be stated as follows: 

Given n> | is it true that for all (almost all) large m there exists a k, 


1<k<m-—vnso that 
k+n\\i{m+k 
9 
[ ' Ἷ , ): (1.4) 


If not, what is the density d*(n) of integers m for which (1.4) has a solution 
with 1 <k<m—n? 

In Section 2 we show that for bounded ratios m/n and any ὃ > 0 we get 
only a finite number of solutions of (1.2) with k > on. 

In Section 3 we treat the special case in which the set {n + 1,...,n +k} 
contains a prime and m/n is bounded to show that (1.2) has only a finite 
number of solutions 2 < k < m — nin that case. We give an example which 
may prove the only one with m < 2n. Finally, we mention some additional 
problems and conjectures. 


2. The Case k > on 
In this section we prove the following. 


Theorem 2.1 Given positive numbers ὃ, A so that k>6nandn+k< 
m < An, then there exists an ng = No(6, A) so that the congruence (1.2) has no 
solution with n> no. 


The proof depends on showing that for all large n there exists a prime p 
in the interval [n + 1, n+ k] that divides A(n, k) to a higher power than it 
divides A(m, k). 


Lemma 2.2. Assume that the hypotheses of Theorem 2.1 are satisfied and 
that every prime p € [n+ 1, n+ k] divides A(m, k). Then for every ¢ > 0 there 
exists an n, = n,(e) so that for all n > n, there exists an integer 1,2 <1< A, 
with 

(i—e)n+ (l—1)k<m<lk+en (2.1) 
and hence 
(I — 2e)n < k. (2.2) 


Note that it is possible to have every prime that divides A(n, k) also 
divide A(m, k). This always happens when k = In, m = I?n. 
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Proof Let nbe so large that there exists a prime in [n + 1, n + k] and let 
p be the largest prime in that interval. Let / be the largest integer so that 
lp <m-+k. Then | > 2 and for large n we have p > n + k — on. Hence 
m+k m+k m—n 


«- -«---  «16π:ῈΐῈῇ---------- 
Ρ Tek on Τα 4+5—6n 


Now pick n so large that p > n+ Καὶ — en/A. Then 


<1+A-1=A. 


m+k2>Ip>In+ lk—en 
and hence 
([ --- ε)η -Ὁ ([ -- 1k <m. 


Let 4 be the smallest prime so that m + k < (I + 1)q. For large n we must 
have n < q since otherwise | times every prime in [n + 1, n + k] would lie in 
[m + 1, m + k], which is impossible. Let n be so large that 


m+k mt+k en 


eS ae oo ee 
then m < lq and hence 
(1 - 1)m < I(m + k) + én 
or 


m < [lk + én. 


Lemma 2.2 Assume that the hypotheses of Theorem 2.1 are satisfied and 
that s is an integer such that n < k/s and every prime p € [k/t, (n + k)/t],t = 1, 
2, ..., 5, satisfies 

A(m, k) = 0 (mod p’). 
Assume further that for the integer | in Lemma 2.2 we have 


ok an <i(n-+k)<m+ 5k ten (2.3) 


where a/b and c/d are consecutive terms in the Farey series of order s and € is a 
given number, 0 «ε «5. 2. 
Then there exists an n, = n,(e) so that for all n > nz. we have 


m+ 


(ont (1- Sem (1 δ] + (2.4) 


and hence 
k > bd(l — 2e)n > s(1 — 2e)n. (2.5) 
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Proof For 5 = 1, this is Lemma 2.2. We now proceed by induction on 5. 
If max{b, d} « 5, then the result follows from the induction hypothesis. 

If d=s> 1, then ὃ « 5 and the first inequality in (2.4) follows directly 
from (2.3) while the second inequality holds by the induction hypothesis. 

Now assume that b = 5, d < 5. Then the first inequality in (2.4) is still an 
immediate consequence of (2.3). If the second inequality were false, we 
would have 


n+k 


fe Sat ae 
5 5 


If I(n + k) < m+ (a/s)k + en and n is sufficiently large, then there exists ἃ 
prime p so that 


— --- 


a a n+k_ en n+k 
m+ Kk — 2en<sip<m+—k + en and : τ] < 


Hence (s! — a)p < m and (sl+s—a)p>m+k and A(m, k) #0 (mod p’), 
contrary to hypothesis. 

We may therefore assume that I(n + k) > m + (a/s)k + en; then for large 
n there exists a prime p with 


a a én k sgn 
eh Pe 0 and A pe ae 


n+k 
l sl 


Thus, again, (s!—a)p<m while (sl+s—a)p>m+k and A(m,k)# 
0 (mod p*), contrary to hypothesis. 


Proof of Theorem 2.1 If 12 and every prime pe [k/2, (n+ k)/2] 
satisfies A(m, k) = 0 (mod p”), then, according to (2.5), we have k > 3n. Now 
let s be the largest integer for which sn < k. If every prime p ε [k/t, (n + k)/t], 
t = 1, 2,..., 5 satisfies A(m, k) = 0 (mod p‘), then, according to (2.5), we have 
k > s(2 — 2e)n > (5 + 1)n, a contradiction. 

Now if |= 2 and 2n < k, then Lemma 2.3 can be applied as before. If 
1 = 2 and 2n > k, then every prime p € [ἢ + 1, (n + k)/2] satisfies A(n, k) = 
0 (mod p?). But, according to Lemma 2.2, we have 


(4-—3esn<m<m+k<(6+e)n. 
Thus for large n there exists a prime p, 
(ὦ -- ε)ν «η « (ἢ Ὁ ε)η, 


so that for sufficiently small ε we have 3p<m while 5p>m+k and 
A(m, Κ) # 0 (mod p?). 
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3. The Case A(m, k) = 0 (mod A(n, k)); n+ k <m< An and 
{n+ 1,...,n +k} Contains a Prime 


We first mention the interesting example 
A(32, 6) = 37 - A(16, 6). (3.1) 


Here two of the integers 17, 18, 19, 20, 21, 22 are primes and 17 may well be 
the largest n which solves our problem in case A = 2. In the following we 
show that we can find an effective bound for all solutions k, n, m. 

From Theorem 2.1 we know that we can restrict attention to cases 
k < 6n where ὃ is any fixed positive number. Since there exists a prime p with 


n+1<p<n+k<m<An (3.2) 
and A(m, k)/A(n, k) is an integer, there must exist an integer / so that 
m+1<Ip<m+k. (3.3) 
Thus 
Int+l~k<m<lIn+ (! --- 1)k. (3.4) 


Lemma 3.1 Every integer 


xe[nt+ Ln] et Ἐπ 
has all prime divisors less than (I + 1)k. 

Proof Assume that x has a prime divisor q > (1+ 1)k. Now, either 
Ix<m+landlx+q>In+lk>m-+k so that q does not divide A(m, k), 
or Ix >m+k and Ix —q<In+ lk — (1+ 1)k < m and again q does not 
divide A(m, k). 

The set of integers in [n+ 1,n+k]\[(m+ 1)/l, (m+ ΚΥΠ contains an 
interval of length >k(I — 1712] = ks. 


Lemma 3.2 There exists a kg so that A(n, [ks]) has prime divisors greater 
than (1 + 1)k for all k > ko, k < On. 
Proof Set [ks] = t and consider the binomial coefficient 
n+t\  A(n, t) 
t Jt! 
Every prime power q* that divides a binormal coefficient ("7") satisfies 
4 <n-+t. Thus the hypothesis that all prime divisors are < (I + 1)k yields 


[ i ᾿ < (n+ {κατ 1) < (in + ἀγα + kilos k (3.5) 
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for a suitable constant c. On the other hand 


t 
ster) : 
t ΐ 
Now set (n + t)/t = C > 1/6 and compare (3.5) and (3.6) to get 
Cc! < colt 1)k/log K(sk eet 1)k/log k (3.7) 


which is false for k > ky provided ὃ is small enough. 


Theorem 3.2 For each A > 1 there exists only a finite number of integers 
k, n,m such thatk > 1,n +k <m< Anand A(m, k) = 0 (mod A(n, k)) where 
the interval [n + 1, n+ k] contains a prime. 

Proof We first pick ὃ in Theorem 2.1 sufficiently small and then can 
restrict attention to a fixed integer J, 2 <1< A+ ὃ. By Lemma 3.2 we have 
k<k y. Now pick one of the integers xe[n+1,n+k] so that 
Ix ¢ [m+ 1, m+ k]. Then, by the same argument that we used in the proof 
of Lemma 3.1 we have (x, y) < (1+ 1)k for every ye [m+ 1, m+ k] and 
hence, if x | A(m, k) we must have n < x < ((I + 1)Κ}Ἐ < (J+ 1)ko)*. 

We have not carried out the detailed estimates needed to show, for 
example, that the example stated at the beginning of this section is the 
unique solution for A = 2, except for A(4, 2) and A(8, 2), but it would not be 
difficult to do so. 


4, Open Questions 


4.1. In view of Lemma 2.2, it would be interesting to know the smallest 
m > 2k so that every prime in the interval [k + 1, 2k] divides A(m, k). In 
particular, is it true that m > Κ' for every c? 

4.2. We know of no example with n > 16, k > 2, where A(n, k) divides 
A(m, k) and n + k <_m < 2n. It would be interesting to find a bound for such 
n without the hypothesis that there exists a prime in the interval 
[nt+1,n+k]. 

4.3. A question related to those discussed in this paper is to find solu- 
tions for A(n, k)| A(n + k, n + 2k). Charles Grinstead has found the follow- 
ing examples: 


ΠΣ ee. 4 5 6 7 8 9 
k:| 4 3 206 1886 3472 3471 8170 8169 
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Introduction 


The Gauss sums for multiplicative characters of finite fields, which play 
such an important role in classical number theory, appear again in the 
functional equation of Abelian L-functions, as local Gauss sums 1,(x) for 
prime ideals p which are tame at y, 1.6., divide the conductor to at most the 
first power. Going over to Artin L-functions, one gets analogously tame 
local Galois Gauss sums for arbitrary characters of Galois groups, derived 
from the local constants of Langland. For details see, e.g., Martinet [4]. 

For tame Abelian local Gauss sums, the quotient 


]οίχ, &) = τι(χ)τ,(φ)τ, (φχ)"᾽. 
in the nontrivial case when p is actually ramified at y, ¢, and χῴ, is a Jacobi 


sum, hence an algebraic integer. We shall be concerned with the correspond- 
ing “non-Abelian Jacobi sum” 


]ο(χ, P) = Ty(x)°8* 2, (PFT, (xh) *, (0.1) 
deg(z) denoting the degree of χ, always under the hypothesis that p be tame 
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at y and @. From the connection between Galois Gauss sums and Galois 
module structure of algebraic integers (cf. Frohlich [1]) and using the theory 
of module resolvents (cf. Frohlich [2]), we shall derive an interpretation of 
the fractional ideals generated by the j,(y, @) which then implies that they 
are integral. This interpretation is new and of interest even in the classical 
absolutely Abelian case. Our method will also yield, without further effort, a 
similar interpretation for the quotients of local conductors, which, however, 
lies less deep. 


1. Statement of Results 


Let Q be the algebraic closure of Q in the field of complex numbers. A 
number field K is always a subfield of Q of finite degree over Q, and we write 
Gal(Q/K) = Q,. Throughout N/K is a normal extension of number fields 
with Galois group Γ and p is a finite prime divisor of K which, except for 
Theorem 1, will be assumed to be tame (i.e., at most tamely ramified) in N. If 
x, @ are characters of Γ, then j,(y, Φ) is defined as in (0.1), and k(x, ¢) is the 
field obtained by adjoining to the number field k the values of y and ¢ on T. 


Theorem 1 For we€ Qo, 
Ilx”, $°) = 7, (χ, Φ)". 
In particular j,(x, Φ) ε O(n, Φ) 


This theorem can be proved quickly. Observe that j,(z, w) is the local 
Galois Gauss sum 1,(9) for the virtual character 0 = deg()y + 
deg(v)¢ — χῴ, and that det, = 1 on I. Then apply the formula for Galois 
action in Martinet [4, II, Theorem 5.1]. 

Denote by R(I) the group ring of I over a ring R. Let E be a number 
field containing K, and let V, W be E(I-)-modules of finite dimension. F will 
also act on the algebra N @, E via the first factor. If f; Κ᾿ “:͵Ν @x E, 
g: W --οΝ ᾧ κξκ are homomorphisms of E(T)-modules so is the product fg 
given by | 

V @ WLES(N @x E) Θὲ (N @x ΕἸ τ N Qx E. 
We get a homomorphism 
μ: Hy ὃε Hw > Hyew (1.1) 
with μ( g) = fg, where we abbreviate 


H,= Homezn(V, N ®x E). 


Theorem 2 is an isomorphism. 
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Denote completion at p by subscript. Let 0, ©, 5 be the rings of algebraic 
integers of K, N, and E, respectively. Let M, N be lattices over 0 ,(I"), span- 
ning V, and W,, respectively. Abbreviate again 


Hy = Homs,n(M, (Ὁ @,0),). 


Thus Hy spans Hom, ἡ, (N @x E),) which we identify with Hy,,. 
We again get a homomorphism 


A,:Hy@Hy>Hmuen  (@= ®s,), (1.2) 


with A ,(f® g) = fg. By Theorem 2 this is injective and has finite cokernel. 
Moreover cok A, is an 0,-module, hence an 5-module. Denote its order over 
δ by I(M, N). This is an ideal of δ, where we now take E = K(y, Φ), with χ 
and @ the characters of the representations of Γ afforded by V and by W, 
respectively. Write moreover t(M, N) for the order of cok 4,, viewed as a 
module over the ring Ζίχ, Φ) of algebraic integers in Q(x, ¢). Then 


t(M, N) = Nxq, eau, 6) (Μ, N), (1.3) 


(norm). Also if @ € Qx and M® is the module obtained from M by semilinear 
action of w then 


{μ᾿ ΝΟ ἢ» = t(M, N). (1.4) 
Let now k= Καὶ ἡ Q(x, Φ)ὴ and let {co} be a right transversal of Q, in Qo. 
Theorem 3 


(i p(x. Φ)) = [1 time’, No’). 


Corollary j,(x, φ) is an algebraic integer. 


Next let ΜῈ denote the contragredient 0,(I)-module to M, ie. 
ΜῈ = Hom, (M, 6,). Let f,(z) be the local conductor. 


Theorem 4 
Fo(x)PF,(G) ΡΟ (ΧΦ) | = I(M, N)I(M*, N*). 


The corresponding corollary is known, even in the wild case (cf. Martinet 
[4, Exercise 2]). 


2. The Evaluation Pairing 


The notation is always that introduced in Section 1. We shall write 
ty: Hy Θεὲ VON @x E, ty(f, υ) =f(v) 
for the evaluation pairing. Let { f;}, {v;}, {g,}, {w,} be E-bases of Hy, V, Hy, 
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and W, respectively. We know that the E-dimensions of ἢν and of V coin- 
cide and that det(f,(v,)) is an invertible element of N @x E. Also 


det((f.91)(v; © ν")) = (det(fi(v,)))°* (det(g.(w,)))°*™, (2.1) 


where the row index on the left is (i, k), the column index is (j, ἢ). Thus the 
determinant in (2.1) is invertible and this implies that μ is injective. Counting 
dimensions we conclude that μ is bijective. This yields Theorem 2. 

The above applies equally if we complete at p. Let now {fj}, {v;}, ἰσκ)» {wis 
be 0 ,-bases of Hy, M, Hy, and N, respectively. Define 


s(M) = s*(M, (Ὁ @,0),) = 0, det(f ,(v,)) (2.2) 


as in Fréhlich [2, 83]. This indeed only depends on M, not on the choice 
of bases. By (2.1) 


5(M)*8s(N ie = 9, ἀερί((  σι)ίυ; Ὁ w))- (2.3) 


The multiplication of the 5,-modules on the left of (2.3) is induced by that in 
N @,E, see, e.g., Frohlich [2, A.2]. On the other hand, let {h,} be an 0,-basis 
of Hy @ nv (Ὁ = @,,). Then via (2.3) we get 


5(M)%°25(N)*8”) = s(M @ N)I(M, N),, (2.4) 


following the procedure to compute module indices in this situation given in 
Frohlich [2, A.1]. Indeed one can see this easily. If (aq, ,,,) is the matrix 
transforming the basis {h,} of Hy gw into the basis f, σι. then on the one hand 


I(M, N), =9, det(a¢, 4, r)> 
while on the other hand 
det((f:9x)(v; ® w;)) = det(ag, υ, ε) det(h,(v; Ὁ w,)). 


Now use (2.1). 

Let g be the multiplication N @x E> ΝΕ « Q, NE the composite field. 
Apply g, to (2.4) and observe that g acts as the identity on fractional ideals 
of 0,. We get 


9_(5(M))**g,(s(N))°*™ = g,(s(M @ N))I(M, N),. (2.5) 


3. Resolvents 


Let ao,(I') = ©,. Let T be a representation of I over Q, corresponding 
to the character y. In Q, = Q @x K, we define the resolvent by 


(a| x) = det(), a’T(y) *). 
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This is an element of Q*, in fact of (NE)*. But (cf. Frohlich [1, Theorem 217) 


g,(s(M)) = (a|x)o, 
Hence by (2.5) 


[(a|x)**8(a|p)***™(alxp) "Jo, = TM, N),.- (3.1) 


Indeed the expression in square brackets on the left lies in ΕἾ. View it as an 
idéle. It then defines a global fractional ideal of δ, “localized” at primes 
above. Similarly I(M, N) is.an ideal of 5 localized at these primes. From 
(3.1) we now get, in the obvious notation, the equation 


((a| x)**#(a| p)**™(a| xb) *) = UM, N) (3.2) 


for ideals of δ. 

The proof of Theorem 3 now follows from (3.2) and Frohlich [1, 
Theorem 4], using (1.3), and the observation that if {co} is a right transversal 
of Qe in Ὥρα, 4), {a} one of Q, in Qo, then {wo} is one of Ox in Qo. 

Similarly, the proof of Theorem 4 follows from (3.2) and Frohlich [1, 
Theorem 18]. 
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Let k be a finite abelian extension of the rationals Ο. We take “the Hasse norm theorem holds for k” 
to mean that each nonzero rational number is the norm of an element of k if and only if it is the norm 
of an element from each completion of k. This paper gives computable necessary and sufficient 
conditions for the Hasse norm theorem to hold for k in the case where k satisfies the following 
properties: (i) [k : Q] is a power of | where lis any prime, (ii) k is the composite of cyclic extensions 
of Q that have pairwise relatively prime conductors, and (iii) the conductor of k is odd. 


Introduction 


In 1931 H. Hasse [7] gave an example, i.e. k = Q((—3)'/7, 1312). that 
showed that the Hasse norm theorem did not hold for arbitrary abelian 
extensions. In 1936 A. Scholz [12] gave conditions for the Hasse norm 
theorem to hold for certain noncyclic extensions of degree /? over the ground 
field where | is a prime. In 1967 J. Tate and J.-P. Serre [1, p. 360] gave a 
method for producing numbers that were local norms everywhere but not 
global norms from Q(13'/?, 17*/7). In 1971 C. Pitti [11] and now O. Taussky- 
Todd [13] have given substitutes for the Hasse norm theorem for Klein 
4-group extensions. 
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This paper continues the project started in Garbanati [4, 5] of classifying 
those abelian extensions of the rationals for which the Hasse norm theorem 
holds. 


1. Preliminaries 


We now state those results of Garbanati [5] that we shall use but not 
prove and we introduce some new notation. 

Let k be a finite abelian extension of the rationals Q with Galois group 
G(k/Q). 

If F is a field, let F* be the multiplicative group F — {0}. 

Let p be a prime divisor (finite or infinite) of Q and let B € Q*. Let (β, k), 
be the Hasse norm residue symbol (cf. Garbanati [5] for a definition). Let N 
be the norm map from k to Q. By an abuse of language we shall take the 
phrase “the Hasse norm theorem holds for Κ᾽ to mean that “for any B € Q* 
there exists a B € k such that Nf = β if and only if (B, k), = 1 for all prime 
divisors p of Q.” 

Let K’ be the “narrow” genus field of k, 1.e., the largest abelian extension 
of k that is unramified at all (finite) prime ideals of k and which is abelian 
over Q. 

Let K be the “narrow” central class field of k, i.e., the largest abelian 
extension of k that is unramified at all (finite) prime ideals of k and which is a 
Galois extension of Q such that G(K/k) is a subset of the center of G(K/Q). 

Let σ΄ -- [Κ΄ : ΚΊ and z* =[K: k]. 

Theorem 1 Let k be an arbitrary finite abelian extension of Q. Then the 
Hasse norm theorem holds for k if and only if z* = σ΄. 

Proof [5, Theorem 1]. 

Theorem 2 Let k, and k be arbitrary finite abelian extensions of Q such 
that kj > k. If the Hasse norm theorem holds for kg, then it holds for k. 

Proof [5, Theorem 2]. 

Lemma 1 Let ky and k be arbitrary finite abelian extensions of Q such 
that ky > k. Let Ky be defined for ky as K was defined for k. Then Καὶ -: Kg. 

Proof [5]. 

Let k, = Κ΄ Let K, be defined for k, as K was defined for k. If G is a 
group, then let (G, G) be its commutator subgroup. 

Lemma 2 Let k be an arbitrary finite abelian extension of Q. Then 
G(K, /k,) = (G, G), where G = G(K, /Q). 

Proof [5, Lemma 14]. 

Let Q,, = Q(e?™") where m is a positive integer and let f be the conductor 
of k, 1.e., the smallest positive integer such that Κα Q,. 
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If G is a finite abelian group, let G’ denote the group of characters of G. If 
n is a positive integer, let G,, denote the multiplicative group of integers 
mod n. Let G=G(Q,/Q) and H = G(Q,/k). Then G(k/Q) = G/H. Let 
t=e?/S If ae Z (the integers) and a and f are relatively prime, let 
σα: ζ - ζ bea typical element of G. Let τ be the isomorphism of G onto Gy 
given via a, > a. Let H, = t(H). Let X = X, be the group of all the charac- 
ters of ΟἹ, that take on the value 1 at each element of H,. Let 


f=pil-- pe 
be the prime power decomposition of f. Since 
G, = Gy, Χο X Ga, 
where n, = ρα (1 <i - 5) we can identify the two groups above and write 
Gi, =Gi, Χ ΘΕ x Gi, 


thinking of the characters of G, as s-tuples of characters. For 1 <i<s 
consider the projection map 


pr;: ΟἹ > G,, . 


Let X,; = pr, X. Let X be the external direct product of the X; for 1 <i<s, 
1.€., 


i=1 
By [8, p. 357] X = Xx. 
Throughout the rest of this section we shall assume that: 


(I) [k:Q] is a power of | where / is any prime; 
(II) the conductor f of k is odd. 


Let Κο be the largest field such that k < ky c Q, and [ky : Q] is a power of 
I. Let ΧΟ = X,,. Since p; (1 <i - 5) is odd, X9 is cyclic. Let y? be a genera- 
tor of X°. (We shall require that y? be a certain type of generator of X? in 
Section 3.) Let o(-) denote “the order of.” Let «(i) = o(X?). Let 6; = εὐ πα, 
We define [p;, p,] (1 <i, j < 5) by the conditions 


0 «[»ι, Ρῃῇ 3 α(ἢἡ,  [p. pi] = (i), 
and 
42(p;) = PPA if FH: 


Let I(p;|p;) be the inertia group of p; where p; is some prime ideal in ky 
dividing p;. Let o;,, € I(p;|p,). (We are using o; instead of (p;) which was 
used in [5].) 
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Fic. 1 


Let Ky be defined for ky as Καὶ was defined for k. By Lemma 1 we have 
Fig. 1. 

By Garbanati [5, Lemma 11] there exists o;z, € I(;|p;) (the inertia 
group of p; where §; is a prime ideal in Κο) such that o;x, restricted to ko is 
Ciko- a 

Let σι be o;x, restricted to K,. 

Let X = X,, « X° = X,,.. Let B(i) = 0(X;). Let B(i, j) = min{B(i), B(j)}. 

If G is a group, let (g,, 92) be the commutator σ᾽ 'g.g, gz‘ and let cen G 
be the center of G. 

Suppose k, = Κ' is noncyclic. Then for appropriately chosen oj, we 
have the following lemmas. (We shall explain how to choose o,,, in Section 
3.) 


Lemma 3 Let G = G(K, /Q). Then G is generated by the o; (1 <i <s) 
and (G, G) « cen G. 
Proof [5, Lemma 13]. 


Lemma 4 Let W be the free group with (5) generators w,;(1 <i<j<s). 
Let N be the normal subgroup generated by the relators 


(νν;; » Woe) l<i<j<s, 1<b<c<s (1) 
wht D  L<i<j<s (2) 
[] wie ΠΠ wpe ol <iK<s. (3) 
i<i i<j 


(Note that i is a fixed number between 1 and s in the last type of relator.) 
Then : 


W/N = G(K,/k,) 
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via 
wii N =? (6; $ g;). 


Proof [5, Lemma 15]. 


2. A Criterion for the Hasse Norm Theorem to Hold 


Throughout the rest of this paper we shall define G via 
G = G(K,/Q) 
and H via 
H = G(K, /k). 


Let (G, H) be the normal subgroup of G generated by the elements by 
(g, h) where g € Gand he H. 


Lemma 5 Let k be an arbitrary finite abelian extension of Q. Then 
G(K,/K) = (G, H). 

Proof Let E=G(K,/K). Let ge G and he H. Then (g, ἢ) ε E, ie. 
gEhE=hEgE because H/E = G(K/k) < cen G(K/Q) =cen G/E. Thus 
(G, H) < E. Now to show E « (6, H). Let K be the field fixed by (6, H). 
Since (G, H) « (G, G), it follows from Lemma 2 that K > k, > k. Also K, is 
unramified over k,, which in turn is unramified over k. So K is unramified 
over k. Since g(G, H)h(G, H) = h(G, H)g(G, H), we get G(K/k)= 
H/(G, H) < cen G/(G, H) = cen G(K/Q). Therefore, KCK and_ so 
E <(G, H). 


Lemma 6 Let k be an arbitrary finite abelian extension of Q. The Hasse 
norm theorem holds for k if and only if (G, G) = (G, H). 


Proof By Theorem 1 the Hasse norm theorem holds if and only if 
K =k, which happens if and only if (G, H) = G(K,/K) = G(K,,/k,) = 
(G, G). 


3. The Structure of (G, G)/(G, H) for Fields with Properties (1){IV) 


In this section, for a specific type of extension of Q, we shall: (a) find a set 
of generators of H (Lemma 7), (b) find a set of generators for (G, H) (Lemma 
8), and (c) give a matrix method for determining the invariants of 
(G, GG, H). 

Throughout this section we shall assume that k is an abelian extension of 
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QO with the following properties: 


(I) [k:Q] is a power of | where ἰ is any prime. 
(II) The conductor f of k is odd. 
(III) Καὶ is noncyclic. 
(IV) k is the composite of cyclic fields with pairwise relatively prime 
conductors, 1.e., 


k=L, +L, 


where each L, (1 <d < ἢ is a cyclic extension of Q and if f, and fj, are the 
conductors of L, and L,,, respectively, and ὦ # d’, then f, and f,,are relatively 
prime. 

We shall now produce X = X,. Let Sy (1 <d < t) be defined via 


S, = {i|1 <i<sand p, | ἢ}. 


Without loss of generality we may assume that the p; (1 <i < 5) are ordered 
so that if 1<d<d' <t and ie δὲ and i’ € S,, then i<i’. Let wy be a 
generator of X,,. Then 


X, = X,, Χο X Xp, = Wa ΧΟ Χ Wa: 


Ifi e Sy, let χ; = pr; wz. As mentioned in Section 1, we now restrict 7, ie., x? 
must be a generator of X° having the property (x?) = y;. Let 
ν; = e?7™/BO. Then 


x(p))=viPrPh ak f Fi. 

Let k; be the largest subfield of Q,, such that [k;: Q] is a power of ἰ. In 
Section 1 we stated that Lemmas 3 and 4 held for appropriately chosen 
σικο € 1(p;|p;) where p; is a prime ideal in ky. To be more precise this means 
σικο generates I(p;|p;) and oj, satisfies the following relation for all j 
(1 <j <s) [5; 2, p. 238]: 

(0; . Kilo = Fike ”” (4) 
where (p;) = p;Z. (For this equality we are really thinking of o;,, as res- 
tricted to Κι.) We shall now produce such a σικο. Let a; be an integer (rela- 
tively prime to n;) such that y?(a;) = ὃ; and hence y,(a;) = ν;. Let a; be an 
integer such that 

a; = αἰ mod n; and a; = 1 mod f/n;. 
Then 
Xi(a;) = v; (5) 
Let ζ = e?"/S, Let o;,, be the automorphism on ky given by restricting ¢ — ¢*! 
to kg. Then o;,, generates I(p;|p;) and for i # j we have y?(al?* 4) = x?(p,). 
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Therefore if j # i, then 


k; -" 
(p; 2 k; wn = ΕΣ ΞΞ ot be Pil 
J 


where (+) is the Artin symbol. If j = i, then (4) holds trivially. 
Let c(d) (1 < d < t) be an integer in S, with the property that 
B(c(d)) > BG) ~— forall ie δι. 
If ie Sj, let 


u(i) = B(c(d))/B(i). 
(Note that i determines d.) If i € δ. let 
t= δ. *. 
Lemma 7 The automorphisms τὶ (1 <i -Ξ 5) and the elements of (G, G) 
generate H. 


Proof Sincek, > k, we get by Lemma 2 that (G, C) c H. We now show 
t; € H where ie S,. For this proof, let b; = at}. Then 1; restricted to k is 
ζ -- ζδιαι 1 restricted to k where a; ! is the multiplicative inverse of a; in the 
integers mod f. But by (5) we get 


Walbia; " = Lea ae )xi(ai)~ a = wy, 1 = 1, 
Thus τ, € H. 
Now we need only show that any automorphism in H can be written as a 


product composed of the τ; and elements of (G, 6). Let h € H. Since h ε G, it 
follows from Lemma 3 that 


h=hy hg 
where g € (G, G) and for 1 <d<t 


ha — []o; “. 
ie Sq 


Let all products which follow in this proof run over all ie S,. Since h e H 
we get h, fixes L,. Furthermore, h, restricted to L, is ζ > ¢" restricted to L, 
where m(d) = | | a; “. Thus 


1 = W4(m(d)"*) = I] χα, γε! = [] χεω(δι)" = χιω ( bj’). 
Therefore ζ > ¢" restricted to k, is 1 where n(d) = |] bf. Thus |] ot(* 
restricted to k, is 1, ie., [] ot e (6, G). So we have 


μιί (σ, G) = I] ohio, (Ὁ, G) = I] t5(G, G). 


Before proving the next lemma we note that if a, b, and c are elements of 
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some group E and (E, E) c cen E, then 


(a, bc) = (a, b)(a,c) and = (ab, c) = (a, c)(b, c) (6) 
(a, δ΄ 1) = (α, δ)" (7) 
(a, b) = (b, α)" ἷ. (8) 
Lemma 8 The following elements generate (G, H): 
(Coa » Coan KOE i> συ)" (9) 


whereieé S,and i’ eS, and1<d<d' <tandi< ii’. Note that ifd = d', then 


(9) reduces to (6;, ¢;,)~'. 


Proof We first show that an element of the type (9) is in (G, H). By (6), 
(7), and Lemma 7 
(Feta) » Feqay (σι Fir)” = (σι » Tr) € (G, H). (10) 
Similarly, 
(6; σι} (σ,,, σι) * ε (G, H) (11) 


Therefore by (8), (10), and (11) elements of type (9) are in (G, H). 

Now we show each element of (G, H) is a product of powers of elements 
of type (9). By (6) and Lemmas 3 and 7, (G, Η) is generated by (0;, τ} where 
1 <i, i’ < 5. Suppose i < 7’. Then 


(6; τι} = (6; 5 Fegan (σι, Fy)’ 
= [(σ : Tan (σ᾿ : Can)” πων 
x [(σξω ’ σι} μΟ (σ; . Oi)” Ἔ 


If i < c(d’), then each element in the brackets is either of type (9) or is 1. If 
i> c(d’), then d=d' and the first component above can be written as 
(σι). σι) κ΄) which is a power of an element of type (9). Suppose i > 7’. 
Applying (8) to the right-hand side of the above shows (σ;, 1;,) is a product of 
powers of elements of type (9). 


Lemma 9 Let k satisfy (I)-(IV). Let W be the free group with (3) genera- 


tors w,;,1<i<j<-s. Let R be the normal subgroup generated by the relators 


given in (1), (2), and (3) and by 
γενεά Wir (12) 
where i€S,, wt€S,, and 1l<d<d<t and i<i’. Then 
W/R =~ (G, G)/(G, HA). 
Proof This follows from Magnus et al. [9, p. 71] and Lemmas 4 and 8. 


We shall now produce a matrix procedure for computing the invariants 
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of (G, G)/(G, H). Let r = (§). Order (i, j), 1 <i <j 3 5, lexicographically, 1.6., 
(1, 2), (1, 3), ..., (1, 5), (2, 3), ---, (2, 5), ..., (5 — 1, 5) (13) 


Let this ordering determine the position of the entries in the following 
matrices. 

Let A be the r x r diagonal matrix A = diag[A(i, j)| where using the 
ordering of (13) B(i, j) is in the (i, j)th row and column. 

Let B be the s x r matrix where the ith row is obtained from (3) as 
follows. Let 1 <j <s,j #i. Ifj <i, put [p;, p,] in the (j, i)th column. ΠῚ <j, 
put —[p;, p;] in the (i, j)th column. Put zero elsewhere in the ith row. 

Let E be the r x r matrix where the (i, i’)th row is obtained from (12) as 
follows. Let i Ε S, and i’ € S,,. If d = α΄, then put 1 in the (i, i’)th column and 
put zeros elsewhere in the (i, i’)th row. Suppose d # d’. If (i, i’) = (c(d), c(a’)), 
put zeros everywhere in the (i, i’)th row. If (i, i’) # (c(d), c(d’)), put u(i)u(i’) in 
the (c(d), c(d’))th column and put —1 in the (i, i’)th column. Put zeros 
elsewhere in the (i, i’)th row. Let 0, be ther x (5 + r) zero matrix and 0, the 
s x (5 +r) zero matrix. Let m = 2r + 5. Let M be the m x m matrix 


A 0; 
M - Β 0, ‘ 
E 0, 


The Smith normal form of an m x m matrix M over Z is the unique 
matrix that results from applying elementary row and column operations to 
M so as to obtain an m x m diagonal matrix 


diag(z,,..., z,, 0,..., 0) 


where z;|z;,, for 1 <i<c— 1 and each Ζ; is a positive integer [10, p. 26]. 
The z,,..-, Z, are the invariant factors of M. 


Theorem 3 Let k satisfy (I)-(IV). Let M be the m x m matrix described 
above. Then there are precisely r invariant factors of M, namely, 2, ..., 2,. 
Suppose.z, Ξε ᾿Ξ Ζ,ΞΞ 1 and z,,, #1. Then 2,4, ..., Z, are the invariants 
of (G, G)/(G, H) ~ G(K/K’). 

Proof The same proof as was given in Garbanati [5, Theorem 11] works 
if we replace A/A‘R by (G, G)/(G, H). 

If b € Z, let δ᾽ be b mod 1. If C is a matrix over Z, let C be the matrix over 
F, (the Galois field with | elements) obtained by replacing each entry b of E 
with δ. 

Let C = [2]. 

Corollary 1 Let k satisfy (1)-(IV). Then (G, G)/(G, H) is the trivial group 
if and only if rank C =r. 

Proof Let U, and U, be unimodular matrices such that ὦ, MU, = 
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diag(z,, ..., z,) +03 where 0, is the (s+ r) x (s +r) zero matrix. Then 
(G, G)/(G, H) is triviale>e = r<>rank ὕ, MU, =re>rank C =r. 


4, The Hasse Norm Theorem for Fields with Properties (I){IV) 


In the following theorem we find a new matrix D over F, such that 
rank D = (5) if and only if rank C = r and then we use Lemma 6 and Corol- 
lary 1 to give a necessary and sufficient condition for the Hasse norm 
theorem to hold for k in terms of the rank of D. 

We now define ἢ. Let k satisfy (I)-(IV). In particular 


k = L, hae L, 
is the composite of t cyclic extensions of Q (namely, L, (1 < d < t)) which 
have pairwise relatively prime conductors. Let S = {p,, ..., p,} be those 


primes which ramify totally in some L, (1 < d < t). Note we are not fixing d 
here, 1.e., perhaps p, ramifies totally in L, and p, ramifies totally in L,, where 
d+#d'. Let w, generate X,,, 1.6.; 


X14 ἘΣ CWa> 


where X,, is the character group associated with L,. Let a, = [L,: Q]/I. Let 
w =e?" Let ἢ be the conductor of L,. Let 


Sa= {i| p; Ε S and p; | Sat- 
If p; ε S but j ¢ S,, define a,; (0 < ay; < ἢ) via 
a‘(pj) = 0%. 

Let u = (5). Order the u pairs (d, d’) where 1 < d < d’ < t lexicographically 
(cf. (13)) and use this ordering for the columns of the following 4 x u matrix 
D (over F;). The jth row of D is determined as follows. If j Ε δ, and 1<d< 
d’ <t, put —@,,, in the (d, d’)th column. Ifj Ε S,and 1<d<d' <t, put δι; 
in the (ὦ, d’)th column. Put zeros elsewhere in the jth row. 


Theorem 4 Let k satisfy (I}-(IV). Let D be the matrix over F, defined 
above. Then the Hasse norm theorem holds for k if and only if rank D = u. 


Proof We shall show that 
rank ἢ = ue>rank C=r (14) 


and then the theorem will follow from Lemma 6 and Corollary 1. We shall 
now prove (14) by performing a series of row and column operations on C 
and then simplifying and analyzing the matrix that results. In what follows 
let ERO (resp. ECO) stand for elementary row (resp. column) operation. 
Consider the E part of C. We use the notation employed to define E. For 
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the (i, i’)th row of E, there are two cases. Either d= d' ord # α΄. Ifd=d, 
then use the 1 in the (i, i’)th row and column of E to change (via ERO) all 
other entries in the (i, i’)th column of C to 0. If d#d' and (i, i’) # 
(c(d), c(d’)), change the i’th row of Bas follows. Multiply the (i, i’)th row of E 
by [p;, p;,] and add the resulting row to the i’th row of B. Also change the ith 
row of B as follows. Multiply the (i, i’)th row of E by —[p,, p;| and add to 
the ith row of B. At this point if (i, i’) # (c(d), c(d’)) and d - d’ the (i, i’)th 
column of C will contain —1 in the (i, i’)th row of E of C and zeros else- 
where. Use this -- 1 (via ECO) to change to 0 the (i)u(i’) in the (i, i’)th row 
and (c(d), c(d’))th column of E. Now change this —1 to +1 (via ERO). 
Change the (c(d), c(d’)) column (via ECO) into a (d, d’) column, i.e., move it 
to the (d, d’) column position. Since 1 < ἃ < d' < t, there are u pairs (d, d’). 
We have now transformed C into the direct sum of two matrices By and Eg, 
i.e, C= By +E, where By is an s x u matrix and Ep is an r x (r -- μ) 
matrix. But E, can be transformed (via EROs and ECOs) into ['"9" ] where 
I,_,, is the (r — u) x (r — u) identity matrix and 0 is the u x (r — u) zero 
matrix. We now have 


rank C =r<rank Bo = u. (15) 


What does the jth row of By look like? If j ε S,,, then in the (d, a’)th 
column (d < d’) we have 


Σ H@uO)LP: + Pil (16) 
If j ε Sg, then in the (d, d’)th column ᾿ < 4) we have 


If j ἐᾷ S, ὦ Sy, then we have 0 in the (d, 4 }1} column. 

Let j δ, where 1 <h <t,ie., p,| f,. Since the ramification index of p; in 
L, is o(x;) = B(/) [8, p. 358] and since [L, : Q] = 0(X_,) = B(c(h)), we get that 
p; tamifies totally in the cyclic field L, if and only if B(c(h)) = β(}) if and only 
if Ly μ()}). 

Consider By. If p; does not ramify totally in L, where j Ε S,, then by (16) 
and (17) the jth row of By is the zero vector. If p; ramifies totally in L, where 
jeS, (ie. u(j) = 1), and if j ¢ Sj, then 


Wal pj) = =[] Xi (p;) = = Il ybPi ΡΠ = =|] yee Pil 


where these products and the product that follows run over all ie S,. Let 
a, = B(c(d))/l. Then v%, = w where ὦ = e?™", Thus 


Walp) ἐς: [7 cyt lpi Pil, (18) 
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If j ¢ Sa> define Qqj (0 « 4] - ἢ via 


y"4(pj) = wo. (19) 
By (18) and (19) 
a4; = Σ, u(i)[p; , pj] mod 1. (20) 


By (20) Bo is in fact the s x u matrix given by the following prescription for 
the jth row. If p; does not ramify totally in L, where j € S,, then let the jth 
row be the zero vector. By (16), (17), and (20) we have the following. If p, 
ramifies totally in L,, where j € S,,, then in the (ὦ, d’)th column (d < d’) we 
have a,,. If p; ramifies totally in L, where j € S,, then in the (d, d’)th column 
we have —4@,,;. If p; ramifies totally in L, where j Ε S, but j ¢ Sy ὦ Sy, then 
we have 0 in the (d, d’)th column. Hence rank By = rank Dand we are done, 
1.e., (14) now follows from (15). 

Let T be the set of all p; (1 <j 3Ξ 5) that ramify totally in L, for some c 
(1 - ς - ἡ (ie. p; Ε S) and which, in addition, split in L, for all d # c. Let 
#(-) stand for “the number of elements in.” 


Theorem 5 Let k satisfy (1)-(IV). If g — #(T) < μι, then the Hasse norm 
theorem does not hold for k. 

Proof If p,€ T, then p; € S and so p; determines a row in Ὁ. Let g; be 
the number of primes in L, that divide p;. Then σ᾽ is the order of the set of all 
x € X,, such that p; does not divide the conductor of χ and simultaneously 
x(p;) = 1 [8, p. 356]. Therefore since ῥ᾽ splits in each L, where d # c, we have 
Wa'(p;) = 1 for all d + c. Therefore the row determined by p, is the zero row 
and so there are at most 4 — #(T) nonzero rows in ἢ. The result now 
follows from Theorem 4. 


The following theorem generalizes Garbanati [5, Theorem 10]. 


Theorem 6 Let k satisfy (I)-(IV). Let k be the composite of t cyclic 
extensions of Q, namely L, (1 <d < τ), which have pairwise relatively prime 
conductors. Let q be the number of primes that ramify totally in some L, 
(1 « ἀ - τ). If q< (3), the Hasse norm theorem does not hold. 

Proof This is immediate from Theorem 4. 


Example Let p,, ..., p; be distinct odd primes. Let 


k = Q((e,P, P2)'!?, (623), (63 Ρ4)᾽ 7, (8405). .7) 


where ¢;= +1 (1<i<4) and ¢,p,;p,=1mod4, ¢;p;4,; =1 mod 4 
(2 <i <4). Then k satisfies (I)-(IV). Since q = 5 and u = 6, the Hasse norm 
theorem does not hold for k. 


Corollary 2 Let k satisfy (I)-(IV). Let k be the composite of four or more 
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cyclic extensions L, of Q that have pairwise relatively prime conductors. If 
precisely one prime ramifies totally in each L,, then the Hasse norm theorem 
does not hold for k. 

Proof In this case 4 = t > 4, but then q < (4). 

The following theorem generalizes Garbanati [5, Theorem 8]. 


Theorem 7 Let k satisfy (I)-(II). Let k = L, - Lz be the composite of two 
cyclic extensions of Q (namely L, and 1,2) that have relatively prime conduc- 
tors. Then the Hasse norm theorem holds if and only if there exists a prime 
p € Q that ramifies totally in either L, or L, and at the same time does not split 
in k (i.e., only one prime in k divides p). 

Proof Let p,,..., pq be the primes that ramify totally in either L, or 1.2. 
We can assume p,,..., p, ramify totally in L, and p,4,,..-, Pg ramify totally 
in L,. Then D is the transpose of the matrix 


[-- ἄχ, — ἅ2χ "τ ἄχ Aint °° Aig). 

By Theorem 4 the Hasse norm theorem holds if and only if one of the entries 
is not zero, ie., either ¥4?(p;) # 1 for some j where 1 <j <n or W4'(p;) # 1 
for some j where n + 1 <j < qand where a, = [L,: ΟἹ" (1 <d < 2). Letg; 
be the number of primes in k that divide p,. As in the proof of Theorem 5 it 
follows from Leopoldt [8, p. 356] that if 1 <j <n, then 4; = 1 if and only if 
W3?(p;) # 1. Ifn+1<j <q, then σ᾽ = 1 if and only if y4{'(p;) # 1. The result 
now follows. 

Such a theorem produces numerous examples where the Hasse norm 
theorem holds in the noncyclic case. 


Example Let k = Q((mp,)'/?, p}/”) where p, and p2 are distinct primes 
and where m is an integer not divisible by p, or p,. Suppose mp, = 1 mod 4 
and p, = 1 mod 4. If (p, /p.) = —1, the Hasse norm theorem holds for k 
(where (+) is the Legendre symbol). This is because p, does not split in k if 
and only if (p, /p,) = —1. 


Corollary 3 Let k satisfy (I)-(IV). Using the notation of Theorem 6 let 
k = L,- L, and suppose [L;:Q] =! for i= 1,2. The Hasse norm theorem 
holds for k if and only if there is a prime dividing the conductor of k that does 
not split in k. 


For more information pertaining to this corollary, see A. Scholz [12]. 


Theorem 8 Let k satisfy (I)-(IV). By (IV) k = L, τ: L,. If for some pair 
{L;, L;} where 1 <i <j <t we have that each prime p that ramifies totally in 
either L, or L, splits in the composite L; 1... then the Hasse norm theorem does 
not hold for k. 


Proof This is immediate from Theorems 2 and 7. 
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Theorem 9 Let k satisfy (1)-(IV). Using the notation of (IV) suppose 


k=L,:L,°L,. Let gq = 3. If for some i where 1 <i < 3 we have for allj # i 
that p, splits in L;, then the Hasse norm theorem does not hold for k. 


Proof Since in this case 


det D = 3,4 ,2@23 — 2143243 


it follows (as in the proof of Theorem 5) from [8, p. 356] that under the 
hypotheses of the theorem det D = 0. 
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Scalar Extensions of Binary Lattices 


ROBERT GOLD PAUL PONOMAREV 


THE OHIO STATE UNIVERSITY 
COLUMBUS, OHIO 


For an algebraic number field E, let 0, denote the ring of integers of E 
and U, the units of ©,. Let K be an imaginary quadratic number field, 
K = Q(./—D), where D is a positive square-free integer. Let F be a totally 
real extension of Q and L = KF the compositum of K and F. In this paper 
we consider the following two types of scalar extensions of binary lattices in 
K. 

First, if 2 is a fractional ideal of K, we can form the fractional ideal WO, 
of L. 

Second, suppose is a lattice in a definite binary quadratic space V over 
Q having discriminant —D (modulo squares). Then V is similar to the 
quadratic space K with quadratic form n(x) = xx’, where J is the complex 
conjugation automorphism of K (cf. [1, §5.1]). Hence we may identify V with 
a quadratic space having underlying space Καὶ and quadratic form c - n for a 
suitable rational number c. We can form the tensor product 2 = 2 @, O,. 
Then & may be regarded as an (,-submodule of L, but it need not be a 
fractional ideal of L, even if 2 is a fractional ideal of K. 

For each type of scalar extension, we have an appropriate notion of 
equivalence. In the first case it is the usual equivalence ~ of fractional 
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ideals; in the second it is isometry between quadratic lattices. In this paper 
we present two theorems on the behavior of these equivalences with respect 
to the corresponding types of scalar extension. The first theorem generalizes 
a result of Kitaoka [3, Corollary to Theorem 2]. The second is the binary 
case of Theorem 2 in Kitaoka [3]. Our method is Galois-theoretic, and as 
such, differs completely from Kitaoka’s, which relies on a study of the 
minima of positive definite lattices upon totally real extension. 

Let the group οἵ roots of unity in L be denoted by W. If F is a Galois 
extension of Q, we shall identify Gal(L/K) with Gal(F/Q) and denote it by G. 
Since J commutes with all complex embeddings of F, e'~/ is a root of unity 
for any unit ς of L (cf. [2, p. 53]). Denote the subgroup of all roots of unity in 
L of this form by W. Let v = card(W), w = card(W). Since Y contains all 
squares of elements in W, we have either w = v or w = 2v. 


Lemma 1 Suppose F is Galois over Q. Let I be a fractional ideal of Κα 
such that UO, is principal. Then UW ~ P, °°: P,, where the P;,i=1,..., 5, 
are distinct primes of K that ramify over Q. 

Proof By assumption, 0, =a0,, ae L. For each oe G we have 
a°O, = (N0,)? = UNO, = «0,, from which it follows that αἱ 5 = ¢,e € U,. 
Then (α' 7)" “ -- εἰ “7, ec’ Ye Vv. Putting B =a’, 8 -Ξ 91", we see that 
(B!~4)'~¢ = 1, so that (β' 4)" = B~/ for allo ε G. It follows that β' 7 ε Καὶ 
and 8!-/ = f!-40@,. Thus 8 ~ 8’, which implies B ~ P, --- P,, where 
the %,, i= 1, ..., 5, are distinct primes of K ramified over Ὁ. 


Lemma 2 Let © be an order of K (not necessarily maximal). Then 
O ~ U, = Ug except if 0 = Ox when ἢ = 1, 3. 

Proof Wecan write Ὁ = Z- 1 + Z - fw, where fis a positive integer and 
w=,./—D or w=(1+./—D)/2, according as -- ἢ Ξ 2,3 (mod 4) or 
—D=1 (mod 4), respectively. Suppose first that w=./—D and 
eeO@ou,. Write e=atbf./—D, abe O,. Assume e¢F. Then 
ε — εἰ = 2bf./—D, so that (1 — e4~*)O, = (ε — €7)O, = (2bf\/ — D)O,. Now 


eJ-! = ζ, a root of unity, say a primitive mth root of unity. Then 


(1 — €)0, = (2bfy —D)O,. (1) 


If m is not a prime power, then 1 — ¢ is a unit [4, p. 267], contradicting the 
fact that 2 divides the right-hand side of (1). Hence m = ρ', pa prime. Since 2 
divides the right-hand side of (1), p = 2. But then 


(1 ae ὥριο, = 20, ; 


where ¢ is the Euler function. Hence g(2‘) = 1, t = 1. Then δ € U;, f= 1, 
D = 1, which implies ζ = —1,¢ = b./—1, O = Ox. 
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Suppose now that w=(1+./—D)/2 and e€ On UL, €€ Uz. If 


e = a+ bfw, then 
ύώ, Ξ Ων —D)O, ’ (2) 


where again ( = ¢!~', a primitive mth root of unity. Since D = 3 (mod 4), 
the right-hand side of (2) # ©,, so m= p’, p a prime, and 


(1 — £)"'O, = pO. 


It follows that p is the only rational prime which can divide D. Hence D = p, 
b € Up, f = 1. Furthermore, (p') = (p — 1)p*~* = 2, which implies p = 3, 
t = 1. Thus ζ = (—1 + ./—3)/2, ¢ = bl, and Ὁ = Ox, completing the proof. 


Theorem 1 Let F be a totally real Galois extension of Q such that v is 
relatively prime to [F : Q]. Let & be a nonprincipal ideal of K. Then WO, is 
principal if and only if there exist distinct primes ghee ..., B, of Καὶ ramified over 
Q, ¥? = p; Ox for rational primes p;,i=1,..., 8. ad. an element a € F, such 
that 


ἡ Pi: P,, (a7)O; = (p, °°" p.)Or - 


In this case, the class of U has order 2 and p,,..., p, ramify in both K and F. 
Proof We need to prove the “only if” part of the statement, as the other 
is trivial. By Lemma 1 we know Y’ ~ P, τ P, for distinct primes Pj, ..., 
R, of K ramified over Q. If WO, is principal, then WU" ~ Ox, where 
= [L: K] =[F : Q]. Since (m, v) = 1, we have WU ~ W" ὦ (P; τ P,)" for 
some integer n. By assumption, is nonprincipal, so n, v must both be odd, 
and we must have A~W ~ YP, - P,. We may assume that 
W= P, --- P,. Then 91" = (p, -“- p,)'P, - P,, where | = (v — 1)/2. From 
the proof of Lemma 1, we have 91" = BOx, where B'-’eK. But 
(915) 7 = Ox, so B'~/ € Ux. Since K has class number > 1, Ux = {+1}. 
Thus f= +f. If pe = B, take a=(p, °° | Ps) β. If Bp’ = —B, take 
a = c./—DB where c € Q is chosen so that αὖ is square-free. 

Remark The assumption on v in the statement of Theorem 1 is neces- 
sary. This can be seen by means of the following example. Let 
K = Q(./—23), a field of class number 3. Consider the cubic extension 
E= Q(a), where 4a* — 9a — 2 = Ο. Since the discriminant of the latter poly- 
nomial is 144-69, E is totally real and its normal closure F is given by 
F=E (./69). Note that L = KF tales a primitive cube root of ὉΠ Put 
a =a+b./—23, where b = (4a? — 3)7!. One easily checks that αὖ = 2 + 
./—23. Put %W-=the fractional ideal of K generated by 3 and 
(1 — ./—23)/2. Then 2° = (2 + ./ —23)O,x, so 91 represents an ideal class 
of order 3 and MO, = αὖ,.. i that in this case w = 6 = [F : Q], which 
implies (v, [F : Q]) > 1. 
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Corollary 1 (Kitaoka [3]) Let F be a totally real extension of Q with 
discriminant relatively prime to that of K. Then every nonprincipal ideal of K 
remains nonprincipal in L. 


Proof The discriminant of the normal closure of F must also be rela- 
tively prime to that of K. Hence we may assume that F is Galois over Q. If K 
has a nonprincipal ideal, then ἢ # 1, 3, which implies, by Lemma 2, that 
Ox c U, = Ug. However, since K, F have relatively prime discriminants, 
Ox = O,. Thus U, = U,, v = 1, and Theorem 1 is applicable. As there are 
no primes ramifying in both K and F, every nonprincipal ideal of K remains 
nonprincipal in L. 


Corollary 2 Let ζ, be a primitive pth root of unity, where p is a prime = 
3 (mod 4). Then every nonprincipal ideal of Q(./ — p) remains nonprincipal in 
the extension Q(C,). 


Proof In this case K = Q(./—p), F =the maximal real subfield of 
Q(¢,), L = Ο(ζ,). We observe that [F : ΟἹ] = (p — 1)/2 and w = 2p. Hence 
(v, [F : Q]) = 1 and Theorem 1 is applicable. The result follows from the fact 
that K has odd class number. 


Theorem 2 (Kitaoka [3]) Let 2, Mt be definite binary quadratic lattices 
over Z and F a totally real extension of Q. If 2 is an isometry of 2 @z Or, onto 
M @z O,, then X(L) = M. 


Proof The underlying quadratic number fields of 2, WM are determined 
by their discriminants. If &, MN are isometric, then ᾧ, M have the same 
discriminants. Hence we may assume that both are lattices in a single ima- 
ginary quadratic number field Καὶ = Q(./—D). Furthermore, after scaling 
suitably, we may assume that the quadratic form on & 15 the norm form ἢ of 
K, while the quadratic form on IM is c-n, c a positive rational number. 
Composing Σ with J, if necessary, we may take Σ to be proper. Then Σ is a 
proper similitude of norm c on the quadratic space (K, n). Any such proper 
similitude is multiplication by a nonzero element of L [1, p. 28]. Thus 
M = αὖ with n(x) = ax! = c. It suffices to show that « ε K. From M = ait 
follows that &, Mt have the same order 0. We may as well assume that 
© + ©, when ἢ = 1 or 3. Furthermore, we may suppose that F is Galois 
over Q. Imitating the proof of Lemma 1, we see that αἱ ὅε ὦ αὐ U, for all 
o € G. Thus αἰ 5 € U; for all σ ε G, by Lemma 2. This implies «’~/ € K, 
which, in turn, implies «? ε K. If « ε K, we are done. Ifa ¢ K, then K(a) isa 
quadratic extension of K and, as a consequence of the Galois structure of L 
over Q, K(x) is the compositum of K and a real quadratic field Q(./r) & F. 
Thus we may as well assume F = Q(./r), L = K(a). Let τ be the nontrivial 
element of G. Then αἷ = —a. Suppose 2 = ZA, + ZA,, M= Zu, + Ζμ2. 
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Then I = αὖ implies there exist Q1, 41, ἅς. a, € O, such that 
Ay My + anf, =4Ay, = 3 fy + Ag Uy = αλλ (3) 


with a, a, — a,a, € U,. Applying τ to the equations (3) and adding, we 
obtain a’ = —a,, i = 1, 2, 3, 4. Hence a, = b;./r for some b; € Z, i = 1, 2, 3, 
4. Then a,a,—a,a,=r(b,b,—6,b3), which implies r=1, a 
contradiction. 
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Any cubic extension of a number field is shown to be the resolvent cubic field of a generically 
irreducible quartic polynomial whose Galois group is the alternating group A,. This is applied to 
the question of which groups can occur as Galois groups in finite-dimensional division algebras. 


Introduction 


Suppose k < Lare algebraic number fields. Following the terminology of 
Schacher [2] we shall say L is k-adequate if L is a maximal subfield of a 
finite-dimensional division algebra with center k. A finite group G will be 
called k-admissible if G is the Galois group of a normal extension L of k with 
L k-adequate. 

In Schacher [2] it was asked whether the alternating group A, is Q- 
admissible, Q the field of rational numbers. We show in this note that in fact 
A, 18 k-admissible for all algebraic number fields k. This will follow from a 
general construction which realizes any cubic field as the resolvent cubic 


+ This work was supported under grant GP-33580, Basil Gordon, and GP-28696, Murray 
Schacher. 
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field of a covering quartic whose Galois group is A, and which satisfies 
prescribed local irreducibility conditions. We also study more closely the 
groups that are admissible over all number fields, with particular emphasis 
on the structure of their Sylow subgroups. 

For notational purposes, a division ring D finite-dimensional and central 
over a field k will be called a k-division ring. Throughout this paper we use 
freely the theory of valuations or primes for algebraic number fields, and the 
theory of Hasse invariants of k-division rings. If k is a number field and 4 ἃ 
prime of k, we write k, for the completion of k at 4. The order of a finite 
group G is denoted by |G]. 


1. Groups Admissible over All Number Fields 


It was proved in Schacher [2, 4.1] that all Q-admissible groups have 
metacyclic Sylow subgroups. The following theorem is of the same type, but 
with a stronger hypothesis and a stronger conclusion. 


Theorem 1.1 If a finite group G is k-admissible for all algebraic number 
fields k, then every Sylow subgroup of G is abelian and generated by two 
elements. 


Proof Suppose G satisfies the hypothesis of Theorem 1.1 and P is a 
Sylow subgroup of G, |P| = ρ΄, pa prime. We let k = Q(e,a) wheree,aisa 
primitive p*th root of unity. Since G is k-admissible, we have by [2, 2.6] G is 
the Galois group of a normal extension L of k, so that P is a subgroup of the 
local Galois groups Gal(L,, /k,,) and Gal(L,,/k,4,) for some pair of primes q, 
and q, of k. Let M; (i = 1, 2) be the field corresponding to P in the Galois 
correspondence, so that P = Gal(Lq;/M;). The rational prime p has a unique 
totally ramified extension in k; thus one of the q;, say q,, does not lie over p. 
It follows that the extension Lq, /M, is tamely ramified. Since ¢,, is in M,, 
we conclude by Albert [1, Theorem 4] that P is abelian. By [2, 4.1] P is either 
cyclic or the direct product of two cyclic groups. This completes the proof of 
Theorem 1.1. 


It follows from Theorem 1.1 that the symmetric group S, is not admis- 
sible over all number fields. This answers a question raised in Schacher [2]. 
By Schacher [2, 2.8] S, is admissible over all number fields, and this shows 
that such groups need not be abelian or nilpotent. Of course A, satisfies the 
conclusion of Theorem 1.1; we shall show presently that it is admissible over 
all number fields. A, also satisfies the conclusion of Theorem 1.1; we do not 
know if it is admissible over all number fields. By Theorem 1.1 A, is not 
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admissible over all number fields for n > 6 since its 2-Sylow subgroup is 
not abelian. We are not even aware of whether A, and A, are Q-admissible. 


2. Covering a Resolvent Cubic 


Suppose k is a number field and h(z) = z° + az + b € k[z] an irreducible 
cubic. Let 6,, 8,, 63 be the roots of h. We set 


f (x) = x* — (6y? + 2a)x? + 8(y? + ay + 6)x — (3y* + 6ay? + 12by — a?) 

(1) 
where y is a parameter that will be specialized in k. It is easily verified that 
the resolvent cubic of f(x) is 


g(w) = w® + (6y? + 2a)w? + 4(3y* + 6ay* + 12by — a?)w 
+ (8y® + 40ay* + 160by? — 4α2γ2 — 32aby — 8a? — 6403). 
The roots of g(w) are 
γν; = 2a — 2y? + 4y0;+ 40? (i =1, 2, 3). 
Thus Q(w,) = Q(0,), i = 1, 2, 3. The discriminants of fand g are both equal to 
2!2(y3 + ay + 6)?(—4a — 276?), (2) 


while the discriminant of the field Q(6;) divides — 4a? — 27b?. 

We specialize a bit further. Assume that a, b are algebraic integers and 
that χ᾽ + az + ὃ determines a cubic cyclic extension of k; thus its discrimin- 
ant -- 46 — 27b? is a square in k. This guarantees by (2) that the discrimin- 
ant of f(x) is a square in k, and so the Galois group of fis a subgroup of A,. 
In fact f(x) has Galois group A, if and only if it is irreducible. We aim for 
more; we shall show in fact that y can be chosen so that f(x) is irreducible in 
k,,{x] and k,.[x] for two primes 4; and q, of k. What follows is a description 
of how these primes are chosen. 

Let q be a prime of k at which z* + az + b is split completely; there are 
infinitely many such q available by the Tschebotarev density theorem. Then 
there is an algebraic integer yy € k so that yg + ayo + ὃ =0 (mod 4). If 
yo + ayo + b #0 (mod 42), we stop at this point. Otherwise let x be an 
algebraic integer in k that is a uniformizing parameter at 4. Then if 
γ)ι = Yo + 7, we have 


Yi tay, +b=ye+ ayo t+ bt 3n7yo + πίϑγὲ + a). 


Thus y; + ay, + ὃ #0 (mod 42) unless q|3y3 + a. But 322 + a is the deri- 
vative of h(z) = ζ΄ + az + b, and these polynomials have no common roots 
(mod 4) except for the finite set of primes 4 that divide the discriminant of ἢ. 
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Let then 4, and 4; be primes for which h(z) splits completely and which do 
not divide the discriminant of h. We also assume that q; } 2, i = 1, 2. By the 
Chinese remainder theorem and the discussion above we can choose an 
algebraic integer y of k so that 


y> + ay + b =0 (mod q;) 
but 
y>+ay+b#0 (mod gq?) i=1,2. 
We summarize this construction in: 


Theorem 2.1 Let k be an algebraic number field. Suppose the constants 
a, δ, γε k and primes 41, 42 are determined as above. Then for f (x) determined 
as in (1) we have: 


(a) f(x) is irreducible in k,,[x] and k,.|x]. 

(Ὁ) The Galois group of f over k is Ay. 

Proof If f(x) is irreducible in k,,[x], then it is irreducible in k[x], so the 
order of the Galois group of f is divisible by 4. Since the resolvent cubic field 
of fis the field of the irreducible cubic χ᾽ + az + b, the order of this group is 
divisible by 3. But the Galois group of f is a subgroup of A, since the 
discriminant of f is a square, and so must be all of A,. Thus (Ὁ) will follow 
from (a). 

Let 4 = 4, οἵ 42. Clearly f cannot be the product ofa linear factor and an 
irreducible cubic in k,[x] since the resolvent cubic of f splits in k,[x]. Thus iff 
is reducible over k,, it must be a product of two quadratic factors (not 
necessarily irreducible themselves) in k,[x]. In this event there are q-adic 
integers r, 5, and t with 


f(x) = (x? + rx + s)(x? — rx t+ t). (3) 
Identifying coefficients in (3) gives the equations 
s+tt—r? = —(6y? + 2a) 
r(t — 5) = 8(y> + ay + δ) 
st = —(3y* + 6ay” + 12by — a’). (4) 


It is easily verified that (3y? + a)? = —(3y* + 6ay? + 12by — a’) (mod 4), 
and so f(x) = (x? — (3y? + a)”)* (mod 4). Thus in (4) we have 


s=t=-—(3y?+a)(modq) and r =0 (mod 4). 


Then r(s — ἢ) = 0 (mod 42), but 8(y> + ay + δ) Ξ 0 (mod q’) by construc- 
tion. This contradiction establishes the irreducibility of f(x) in k,[x]. 


QUARTIC COVERINGS OF A CUBIC 101 


We have the: 


Corollary 2.2 A, is admissible over all algebraic number fields. 


Proof Let k be a number field and f(x) the quartic determined in 
Theorem 2.1. Let L be the splitting field of f(x) over k. We wish to show Lis 
k-adequate. By Schacher [2, 2.1] it is enough to show that the abelian group 
H?(A,, L*) has an element of order 12, where L* = L — {0}. Since the 
3-Sylow subgroup of A, is cyclic, H?(A,, L*) has an element of order 3 by 
Schacher [2, p. 455]. Thus it is enough to show that H?(A,, L*) has an 
element of order 4. Let D be the k-division ring with Hasse invariants 
determined by 


inv, (D)=%4 inv,(D)=-—4 inv,(D)=0, 4 3: 4, or 42. 
Since f(x) is irreducible at 4, and q,,4|[L,,: Καὶ (i = 1, 2) and so L splits Ὁ. 
This means that D determines an element of H?(A,, L*) of order 4, and L is 
k-adequate. But Gal(L/k) = A,, and so A, is k-admissible. 

We close with an explicit example over the rational field Q. The irredu- 
cible cubic Z*> — 3Z — 1 of discriminant 3* determines a cyclic cubic field 
over Q, so we choose a= —3, b= —1. If y= 3, then y>+ay+b=17. 
Then the polynomial f(x) becomes 


f (x) = x* — 48x? + 136x — 36 


with discriminant 2!734177. An examination of the Newton polygon shows 
that fis irreducible in Q,[x], and the proof of Theorem 2.1 shows f irredu- 
cible in Q, =[x]. Let L be the root field of f over Q. Then Gal(L/Q) = A, and 
L is Q-adequate. 

We note that all of our remarks apply for global fields of characteristic 
not 2 or 3, and we get: 


Corollary 2.3 A, is k-admissible for all global fields k of characteristic 
p #2 or 3. 
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A typical question in extremal number theory is one which asks how large a subset R may be 
selected from a given set of integers so that R possesses some desired property. For example, it is 
not difficult to see that if R is a subset of the integers [1,2,...,2N] and R has more than N elements 
then there are integers x and y in R so that x + y is also in R. The sets {1, 3, 5,...,2N — 1) or 
{N + 1, N + 2,..., 2N} show that this bound cannot be improved. 

In this note we prove several general results of this type. In particular, we show that if R < {1, 
2,..., N} and R has more than N — [N/n] elements, then for some integers x and y, the integers x, 
x+y, x +2y,...,x +(n— 1)y and y all belong to R. Furthermore the bound N — [N/n] is best 
possible. 
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1. Introduction 


Suppose Y = {L,(x1, ..., Xm) = 1 ayjxj: 1 <i <n} is a set of linear 
forms in the variables x, with integer coefficients α;;. The question we con- 
sider is the following: 


How large may a subset R of {1, 2,..., N} be so that for every choice of 
positive integers t;, 1 <j <m, at least one of the values L,(t,, ..., tn), 
1 <i<n,is notin R. 


Unfortunately, this question appears to be rather difficult and very few 
general results are currently available. In this paper we study this problem 
for several important special sets #. It will be seen that even in these simple 
cases, the problem is not without interest. 


2. Preliminaries 


Let [1, N] denote the set {1, 2,..., N}. If Z = {L(xy,..-, Xm): L< isn} 
is a set of linear forms, we say that a set R Ξ [1, N] is #-free if for any choice 
of positive integers t,,..., tm, at least one of the values L,(t,, ..., t,) does not 
belong to R. If R is not Y-free, we say that # hits R. Define 


S AN) = max |R| 
R 
where the max is taken over all R Ξ [1, N] that are #-free and | R| denotes 
the cardinality of R. Also, define 6(Y), called the critical density of #, by 
6(f£) = lim inf S AN)/N. 
N 

As an example, consider the system Y, = ἰχ + kx2:0<k <n}. The 
condition that R is £,-free means exactly that R contains no arithmetic 
progression of n terms. 

For this example, a recent result of Szemerédi [2], however, asserts that 


any infinite set of integers of positive upper density contains arbitrarily long 
arithmetic progressions. From this it follows at once that 6(#,) = 0. 


3. Augmented Arithmetic Progressions 


We now consider a system closely related to Y, which we denote by LF. 
It is defined by 
L*={x, +kxz:0<k <n} vu {xp}. 


In this case, #* hits R if and only if R contains an arithmetic progression of 
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n terms together with the common difference of the progression. However, 
the critical density of #* differs sharply from that of &, as the following 
examples indicate. 


Example 1 Let R, <[1, N] be defined by 
Κι = {x e[1, N]: x > [N/n}}. 
Clearly R, is #*-free since 
t, + (n— 1)tz >n(1+ [N/n])>N for t,,t, Ε Κι. 
Thus 
6(f*)>1-—n-'. (1) 
Example 2 Suppose n is prime and let R, <[1, N] be defined by 
R, = {x e[1, N]: x #0 (mod n)}. 


Then £* cannot hit R, since for any integers t, and t,, either 
t, =0 (mod n) or t; + ktz, O<k <n, runs through a complete residue 
system modulo n and therefore represents 0 ¢ R,. Note that 


|Ro| = Ν —[N/n] = [Ri I. (2) 


The following result shows that equality holds in (1) and, in fact, (2) is best 
possible. 


Theorem 1 Suppose R&[1, N] with |R| > N—[N/n]. Then #* hits 
R 


Proof Let R satisfy the hypothesis of the theorem and suppose R 15 
£*-free. Let A denote the least element of R. Then we may assume 


A <[N/n] (3) 


since otherwise |R| <N—[N/n]. Define the arithmetic progressions 
T, S [1, N] by 


T; = {i+ kA: « Καὶ <n}, L<i<N—(n-1)A. 
Also, define A;, A; ς [1, N] for 1 <j <n as follows: 
sian eee for 1<j<n, 
— \[(n-—1)A4+1,N] for j=n; 
[N-—jA+1,N—-—(j-—1)A] ἴον 1<j<n, 
[1,Ν -- (n—1)A] for j=n. 
By (3), we see that 


j 


A’ = 


J 


[Ax = [43 5 Δ. 
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Also, it is easily checked that if x € A; τ Aj then j + j’=n-+t for some ¢, 
1<t<n, and 


l{i: x € ΤῊ] =t. (4) 
We claim the following equation holds: 
N-(n—-1)A n-1 
niR[= Σ [BAR| + Yel oR] + 14) ORI) 6) 
i= j= 


To prove (5), let x ε R. Then for some k and k’, x € A, 7 Aj. Since the A, 
are disjoint, as are the A’,, then the contribution x makes to the second sum 
on the right-hand side of (4) is just (n—k)+(n—Kk’). Letk +k’ =n+t. 
Hence, by (4), x contributes exactly t to the first sum in (5). Therefore, each 
x € R contributes exactly 


(n—k)+(n—k’)+(k+k'-—n)=n 


to the right-hand side of (5) so that Eq. (5) is indeed valid. But by hypothesis, 
since Δ ε R, then |7; ὦ R| <n-—1 for all i. Thus, since [42 ἡ R| = 1, 
then by (5) 


n|R| <(n—1)(N —(n— 1A) + 28 Σ (" -η-«-- η(-- ἢ 
-- (πη -- ΠΝ -- A(—(n— 1)? +n(n— 1) -- (ἡ -- 1)) Ἐπ -- 1 
Ξ (n—1)(N + 1), (6) 
which implies 
ay < [Ἐπὰν ἢ]. wf ᾿ 
This proves Theorem 1. J 
Of course, it follows from (1) and : that 
S y,e(N) = N — [N/n] (8) 
and consequently 
δ(. 95) -- 1 -- π΄. 


4. Forms in One Variable—A Special Case 


As a prelude to a discussion in the next section of the general case of 
linear forms in one variable (1.6., with m = 1), we consider first the special 
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case F = {x, 2x, 3x}. This example in fact has all the essential features of the 
general case. 

To begin, we let D = {d, « d, < ---} denote the set of all integers of the 
form 273°, a, b > 0. 

Let N be a fixed positive integer. For 1 <t < N with (t, 6) = 1, let C(t) 
denote the set 


C(t) =[1, N] 7 {td,: k = 1, 2, ...}. 


Note that a set R  [1, N] is #-free if and only if R(t) = R 7 C(t) is #-free 
for all t with (t,6)= 1. For indeed, Y can hit R only if for some x, 
{x, 2x, 3x} > R. However, this implies that Y hits R(t) for some t relatively 
prime to 6. Thus, a maximal #-free set R is formed by taking the union of 
maximal £-free subsets from C(t) for each t, (t, 6) = 1. However, it is clear 
that 


X, = {td,: k= 1,...,r} S C(t) 
is #-free if and only if X, = {d,: Καὶ = 1,..., r} S| C(1) is #-free. Thus, if f(r) 
denotes the cardinality of the largest -free subset of {d,, ..., d,} and h(r) 


denotes the number of t € [1, N], (t, 6) = 1, with |C(t)| =7r, then for any 
L-free set R Ἑ [1, N], 


ΙΕ] < Σ fener) 9) 


For fixed r, |C(t)| =r if and only if 


td, < N < td,4, 


N/d,41<t < N/d,. 
Thus, 


wr) ome ὦ 


and, therefore, for maximal #-free sets Ry Ξ [1, N], 


_ |Ry| 12 1 1 

in τ πὰ Σ ΘΙ. τας) “- 
But 

fr+)-f(r) Ξ Ι, 
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so that letting K(#) denote the set {k: f(k) > f(k — 1)}, the telescoping sum 
in (11) becomes 
1 1 


3 k € K(f) d, 


5(L) = (12) 


Unfortunately, there does not seem to be any simple way to determine the 
elements of K(£#). The first few values are given in Table 1. 


TABLE 1 
κι ΓΤ κα (Ὁ Κ Fk) 
i 1 13 9 25 17 
y “22 14 10 26 18 
3 2 15 il 2 18 
4 3 16 11 28 19 
5 4 17 12. .2. 20 
6 5 18 13. 30 20 
7 5 19 13. 3: 21 
8 6 2 14 32 22 
9 7 2 14 33 22 
10 7 2 15 34 23 
11 8 23 16 35 24 
12 8 24 17 36 285 


Thus, 
K(#) = {1, 2, 4, 5, 6, 8, 9, 11, 13, 14, 15, 17, 18, 20, 


22, 23, 24, 26, 28, 29, 31, 32, 34, 35, 36, ...}. (13) 


It may be that f(k) = 1 + [2k/3] if k #0 (mod 3) and, perhaps, for all k, 
there is always a maximal ¥-free set 


Κι = (293%: i= 1,..., ΓΟ S (dy, ..., ἀρ 


in which all a; — b; are congruent modulo 3. 
It would also be interesting to know if 6() is irrational. 


5. Forms in One Variable—The General Case 


Let Y£ denote the set of linear forms {a,x, ..., a,x} where 
A = {a, <:::<a,}. Let P(A) = {q1, ..., q,} be the set of primes dividing the 
a; and let D‘” = (d, < d, <---) denote the set of all integers of the form 
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gi! --+ q*", a; >0. For each k let f(k) denote the cardinality of a maximal 
&-free subset of {d,, ..., d,}. Finally, let K(@) be defined by 


K(L) = {k: f (k) > f(k — 1). 


By using essentially the same arguments as in the previous section, the 
following theorem can be proved. 


Theorem 2 


τ [0-47 Σ αἱ (14) 


Ma K( #) 


6. Concluding Remarks 


One problem with a representation such as (14) is that it is not clear how 
to describe K(f) so as to be able to evaluate ),- x ἐκ '. Several systems 
L = Lay, ..., Ay) = (a, X, ..., a,x} οἵ forms in one variable are known, 
however, for which such a description can be given. We list a sample of these 
below. The arguments needed to determine the sets K() are not difficult 
and are omitted. 


1. d(L(1, p, p?, ..., p™ *)) = (p" — p)/(p" — 1) for p prime. Thus, 
6(L(1, 2)) = 4 as expected. 

2. (LA(1, n)) = nf(n + 1). 

3, 6(L(2, 3)) = 2. 

4. 5((l, 2, 8)) = 
here. 


81 Some recent results of Harlambis [1] are relevant 


It seems quite likely that almost all systems Y have 6(f) irrational 
although not even one such # is known at present! 
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...ma fait songer au théoréme empirique suivant: 


n étant un nombre entier, soit n, la somme des diviseurs de n, inferieurs a n, soit n, la somme 
des diviseurs de n,, inferieurs an, ; etc. Cela posé: les nombres n,n,,n,,...tendent vers une limite 
A, laquelle est 1 ou un nombre parfait. 


E. CATALAN [5] 


This is the origin of the Catalan—Dickson conjecture. Dickson [9] 
pointed out that the sequence will sometimes cycle between the members of 
an amicable pair, or round some larger cycle, though at the time no such 
cycle was known. 


La periode peut avoir plus de deux termes, qu’on pourrait appeler, pour garder la méme 
terminologie, des nombres sociables. Par exemple le nombre 12496 engendre une période de A[sic] 
termes, le nombre 14316 une période de 28 termes. 

Enfin dans certain cas, on arrive a des nombres trés grands qui rendent le calcul insupportable. 
Exemple: le nombre 138. 


P. PouLet [21] 


Poulet’s calculations led him to believe the contrary of the Catalan- 
Dickson conjecture, namely that there are sequences that increase 
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indefinitely, albeit somewhat erratically. Selfridge and the present writer, 
with considerable computational help from others [1, 2, 8, 11, 16, 17, 19, 20] 
have offered both experimental and heuristic evidence [13-15, 23] for the 
almost diametrically opposite conjecture that almost all even aliquot se- 
quences are unbounded. 

Let s(n) be the sum of the aliquot parts of n (divisors of n other than n 
itself so that s(n) = o(n) — n, where o(n) is the usual sum of divisors func- 
tion. Define s°(n) = n, and, for k 2 1, 


s*(n) = s(s*~ *(n)), 


(here often written n: k). Ὁ. H. Lehmer answered Poulet’s question about 
138 by showing that the sequence has a maximum 


179931 895322 = 138: 117 =2- 61 - 929 - 1587569, 


and that 138: 177 = 1. The next number to give difficulty is 276. Computa- 
tions were made by Paxson [19, 20], by Cohen [6], and, much more exten- 
sively, by Lehmer, and our present state of knowledge of the 276 sequence 15 


107100 047962 427456 048833 497403 019424 = 276 : 433 = 253 - 199 - ς, 


where c is a 31-digit composite number with no small factors. The desire to 
calculate such sequences has been one of a number of stimuli leading to the 
development of a variety of new algorithms for factoring large numbers [12]. 

Another question asked by Poulet concerned the existence of cycles of 
periods other than 1, 2, 5, and 28. In particular it is still not known if period 3 
can occur. Comparatively recently 14 cycles of period 4 have been found by 
Borho [4], Cohen [6], David [7] and Root [3], namely those for 


n= 1264460, 2115324, 2784580, 4938136, 7169104, 
18048976, 18656380, 28158165, 46722700, 81128632, 
174277820, 209524210, 330003580, 498215416. 


An interesting example of a periodic sequence is 17490, for which 
17490 : 228 = 1264460, the smallest member of the smallest of these 4-cycles. 

So aliquot sequences may be classified as terminating if a value of k is 
known such that n: k = 1, or periodic if numbers c 2 1 and k are known 
such that n:k=n:c+k, or incomplete otherwise. Since knowledge is 
steadily advancing, membership of these classes varies with time. A good 
deal of calculation has been done at levels greater than 107*: For example, 
the Lehmers have investigated the following sequences, in addition to that 
for 276: 


35149 477396 986268 016618 686344 127020 = 552: 181 = 27375 - 7%c, 
2 422499 075303 417661 059252 663526 = 564: 265 = 2 - 3723 - 89. ς, 
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166 036689 342670 728789 598390 618080 = 660 : 168 
S273 oo 6 11. 7 He, 
8 636062 816231 780695 927699 123810 = 840: 195 
= 2937571 536 
34 177685 625161 284824 431442 117328 = 966: 184 
= 277 139 τῷ: 
H. J. Godwin (written communication) has calculated 
8502 332502 428553 163137 122717 455339 104096 = 1074 : 849 
= 253-31. 457. ς, 
(assuming that ἃ 38-digit pseudoprime in term 847 is prime) and 
60012 099828 734883 991294 272626 672754 = 1464 : 530 = 2-3: 29: x, 


where x was still to be tested; and Selfridge and Wunderlich (see [14, 17]) 
have pursued all sequences starting below 10* to beyond 1022, However, 
only one sequence, 4488, has exceeded this level: 


1 807814 653481 873196 164108 = 4488 : 459 
= 2713 - 991 - 146819 - 3201581 - 74632871, 


and later been found to terminate, 4488 : 801 = 1. 

Devitt [8] has pursued all even numbers in the ranges (10* + 2, 10" + 
100) k = 9, 10, 11 and 12 (see also [16]) and found the following numbers 
of the three classes of sequences: 


k= 9 10 11 12 
terminating 161 146 139 113 
periodic 7 4 4 4 


incomplete at 1015 332 350 357 383. 


The steady increase of the numbers of incomplete sequences is, of course, 
partly accounted for by the fact that the later samples start somewhat nearer 
the arbitrary bound, 10'°, but this effect is only marginal, and if we calculate 
the average order of s(n) for n even (e.g., by the method given in Hardy and 
Wright [18, Theorem 324] we obtain 


n(52?/24 — 1) = 1.05617n > n, 


so that, on average, such sequences can be expected to increase. In [15] 
Selfridge and the present writer give detailed consideration to the various 
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guides and drivers (see below) which control aliquot sequences. Similar ideas 
are outlined by Wunderlich [23], and by Devitt [8, Chapter 4]. 

H. J. J. te Riele [22] has constructed an aliquot sequence containing 5092 
(and almost certainly many more) monotonically increasing terms. A most 
interesting remark in te Riele’s paper is the last sentence stating that H. W. 
Lenstra can prove the existence of arbitrarily long monotonically increasing 
aliquot sequences. Versions of Lenstra’s proof are given by Erdés [10] and 
Devitt [8, pp. 42-44]. Erdés also proves the stronger result: 


Theorem For all n except a sequence of density 0, and for every k and 
6>0, 


(1 — ayn < s'(n) < (1+ ayn 05} 


for 1 <i <k. 


Here we use Lenstra’s method to establish the following remarkable 
result: 


Theorem Given any prime p,, any integer t, and any real number p > 1, 
there are aliquot sequences containing t consecutive terms, each greater than p 
times the previous one, but whose only prime divisors exceed p,. 


Proof We construct such a sequence, 
n=n:0, n:1, n:2, ..., nit, 
where, for 0 <r St, 
Nir = Pui Pee2 “°° Pete’ ”* Petivt—rM 
and none of the first k + 1+ t — r primes divide m,. 


First choose / so that the sequence diverges with the required rapidity. For 
O<rs<t-—1l1, 


= 1 
n:fr n:fr n:f 
Pasi Pae2 °° Pasir, 
_o(PRss), σ(ρῖ3..), lm) __, 
Put Par2 m, 
(Dis 1 O(Di-2) σίρε) 1 
Pevi  Per2 Pest 
ς Ρκει a 1 Pr+2 4 1 Peri tl, 


9 


Pr+1 Pr+2 Pr+i 
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and since [] (1 + p~*) diverges, we may choose | (e.g., ἰ = [k?** ]) to make 
the ratio greater than p. 
Next choose a, to exclude the first k primes as factors of the n: r, e.g., 
= (p?p3--- p2) — 1. By the Euler-Fermat theorem, since (p;, P,.+1) = 1 
for 1 <i <k, 


pri, | = 1-mod p;, 


so pat} =1 mod p,;, and p,|o(pti1), pi|o(n:r), and since p,f{n:r, 
p¥s(n:r)=n:r+1 for0 <r <t— 1 and eachi, 1 <i <k. 

Finally, choose the remaining a;,,, 1 <j </+ t— 1, recursively, so as 
to ensure that 


pi; ||n:r for 1<j<l+t—r. 
E.g., take a;,, = σίρ 1) — 1, so that, as before, 
peri = = 1 mod per; , 
so pet; |o(ptiti+ 1), pet |o(n: τ), and since pf’, ;||n:r, we have 
pi. |s(nir)=nirt+1, O<r<et-l. J 
As can be seen from the form of the proof, the powers of the primes used 
are subject to eventual erosion. Dickson [9] looked for combinations of 
powers of small primes that would “lock in,” but any such search is doomed 


to failure; in fact Selfridge and the present author [15] give the following 
result. 


Theorem For any d, there is some prime divisor of d that does not divide 
a(d)/d. 


On the other hand, some combinations do have remarkably long “lives,” 
and these are important in making quantitative estimates. We restrict our 
attention to the factors of (2°) = 24 — 1. Define a guide g to be g = 2 υ 
where v divides o(2°). If also 25 * divides o(v), the guide is called a driver. If 
g = 2υ divides n and 2° (driver of) n. Drivers 
include the even perfect numbers; there are only finitely many others. 


Theorem [15] The only drivers are 2, 2°3, 2°3-5, 2°3-7, 273-11-31 
and the even perfect numbers. 


We may calculate the average order of s(n)/n for those n having a given 
guide, and this gives an estimate of the amplification of that guide, or ex- 
pected ratio of s(n)/n while the guide is in control of the sequence. Table 1 [8, 
p. 58] gives calculated amplifications for some of the more commonly occur- 
ring guides, and compares them with the results obtained from studying 
more than 186,000 terms, lying between 10° and 101°, of the 2000 sequences 
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with n even, 10 + 2 <n < 10" + 1000, 9 <k < 12. Less than 15000 terms 
had guides with powers of 2 greater than the fourth, or odd guides. 


TABLE 1 

Number of Calculated Observed 
Guide occurrences amplification amplification 
2 46526 n?/6 — 1 = 0.64493 0.62787 
243 24323 n?/4 — 1 = 1.46740 1.26744 
2. 32573 3π214 -- 1 = 1.11492 1.07573 
2:7 27607 n?/4 — 1 = 1.46740 1.40354 
2: 10014 n?/5 — 1 = 0.97392 0.97143 
2.3 6018 3π210 — 1 = 1.96088 1.69723 
255 5097 527/18 -- 1 = 1.74156 1.38542 
253 5 2917 n?/3 — 1 = 2.28987 2.17929 
2 11708 15π2|62 — 1 = 1.38781 1.34447 
2431 4919 n?/4 — 1 = 1.46740 1.44918 


The agreement between theory and practice is as good as one can expect. 
“Typical” behavior cannot be expected unless the number of prime factors is 
“large”; that is, when In In n is large, which is when n is well beyond com- 
puter range. The only even guides with amplification less than 1 are 2 and 
(but only just!) 2°. These two account for 56 540 of the terms examined, less 
than one-third of the total. 

One can also estimate, and observe, the break probability for a guide to 
change to a different guide, and the life of a guide, or expected number of 
terms in a sequence before the guide changes. 

For example, the break probabilities for the 2-driver and for the 2° guide, 
among terms of size n are approximately 


Zinn and (In In n)?/3 In ἢ, 
and the corresponding lives are approximately 
S(In n)/9 = and δ8344η n)/(In In n)?. 


With the previously calculated amplifications, we can estimate the size of a 
term after an average run under the control of a given guide. For these two 
guides we can expect a term of size n to become of the order 


n>! and ni — (In In n)2/14 
For the sample of 2000 sequences mentioned above, the observed lives were 


9.33 and 2.7, 
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The fact that these are less than the estimates is again due to the fact that the 
size of term that can be investigated by computer is far too small to exhibit 
typical behavior. 

The general behavior of aliquot sequences may be considered as a 
Markov process whose states are determined by the guides that are in 
control. The limit of the powers of the stochastic matrix associated with the 
same 2000 sequences yields the following probabilities of the seven most 
popular guides being in control: 


2 2.5 7: 227 2 2 2°31 
0.2256 0.1025 0.1535 0.1851 0.0482 0.0580 0.0481, 


accounting for 0.821 of the total probability. If we use these (and the other) 
probabilities to make a weighted average of the amplifications we obtain the 
figure 


1.05745, 


in very good agreement with the calculated amplification, mentioned before, 
the average order of s(n)/n taken over all even numbers, 


52/24 — 1 1.05617. 


It is highly unlikely that the Catalan-Dickson conjecture can be refuted, 
but the circumstantial evidence against it seems to have become 
overwhelming. 


REFERENCES 


[1] Jack Alanen, “Empirical Study of Aliquot Series.” Math. Rep. No. 133, Stichting Math. 
Centrum, Amsterdam, July, 1972; reviewed in Math. Comput. 28 (1974), 878-879. 
[2] Jack Alanen, “Tables of Aliquot Sequences,” seven vols. of computer output, lodged with 
R. K. Guy, Univ. of Calgary; reviewed in Math. Comput. 28 (1974), 879-880. 
[3] Μ. Beeler, R. W. Gosper, and R. Schroeppel, M.I-T. Artificial Intelligence Memo 239, 
72-02-29. 
[4] W. Borho, Uber die Fixpunkte der k-fach iterierten Teilersummenfunktion, Mitt. Math. 
Gesellsch. Hamburg 4 (1969), 35-38; MR 40, No. 7189. 
[5] Ε. Catalan, Bull. Soc. Math. France 16 (1887-88), 128-129. 
[6] H. Cohen, On amicable and sociable numbers, Math. Comput. 24 (1970), 423-429; MR 
42, No. 5887. 
[7] Richard David, letter to D. H. Lehmer, 72-02-25. 
[8] J. 5. Devitt, Aliquot sequences, M.Sc. thesis, Univ. of Calgary, May 1976. 
[9] L. E. Dickson, Theorems and tables on the sum of the divisors of a number, Quart. J. 
Math. 44 (1913), 264-296. 
[10] Paul Erdés, On asymptotic properties of aliquot sequences, Math. Comput. 30 (1976), 
641-645. 
[11] 1. Gillogly, Table of aliquot sequences 105 to 10° + 10° (unpublished). 
[12] Richard K. Guy, How to factor a number. Congressus Numerantium XVI, Proc. 5th 
Manitoba Conf. Numerical Math., Winnipeg, 1975, 49-89. 


118 


[13] 


[14] 
[15] 
[16] 
[17] 


[18] 
[19] 
[20] 


[21] 
[22] 


[23] 


RICHARD K. GUY 


Richard K. Guy and J. L. Selfridge, Interim report on aliquot series. Congressus Numer- 
antium V. Proc. Manitoba Conf. Numerical Math., Winnipeg, 1971, 557-580; MR 49, 
No. 194. 

Richard K. Guy and J. L. Selfridge, Combined report on aliquot sequences, Univ. of 
Calgary Math. Res. Paper 225, May 1974. 

Richard K. Guy and J. L. Selfridge, What drives an aliquot sequence?, Math. Comput. 29 
(1975), 101-107. 

Richard K. Guy and M. R. Williams, Aliquot sequences near 10!?. Congressus Numeran- 
tium XII. Proc. 4th Manitoba Conf. Numerical Math., Winnipeg, 1974, 387-406. 
Richard K. Guy, D. H. Lehmer, J. L. Selfridge, and M. C. Wunderlich, Second report on 
aliquot sequences. Congressus Numerantium IX. Proc. 3rd Manitoba Conf. Numerical 
Math., Winnipeg, 1973, 357-368; MR 50, No. 4455. 

G. H. Hardy and E. M. Wright, An introduction to the theory of numbers, 4th ed. Oxford 
Univ. Press, London and New York, 1968. 

G. Aaron Paxson, Aliquot sequences (preliminary report). Amer. Math. Monthly 63 
(1956), 614. 

G. Aaron Paxson, Table of aliquot sequences. Deposited in UMT file 1966, reviewed in 
Math. Comput. 26 (1972), 807-809. 

P. Poulet, Question 4865, L’Intermédiaire des math. 25 (1918), 100-101. 

H. J. J. te Riele, A note on the Catalan-Dickson conjecture, Math. Comput. 27 (1973), 
189-192; MR 48, No. 3869. 

M. C. Wunderlich, The Guy-Selfridge case against the Catalan-Dickson conjecture 
(mimeographed lecture notes). 


AMS (MOS) 1970 subject classification: 10A20. 


Integral Matrices 4 for which 4A‘ = ml 
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1. Introduction 


An arithmetical problem of some interest is the study of integral matrices 
A for which AA! is of some special form. The incidence matrix A of a 
symmetric block design with parameters υ, k, A is a (0, 1) matrix of order v 
satisfying AA’ = (k — A)I + AJ = B, J being the matrix of order v all of 
whose entries are 1. The case we are interested in here is that of an integral 
matrix A of order n such that 441 = ml. This includes all permutation 
matrices when m = 1, and the Hadamard matrices H = [h,,] of order n for 
which every ἢ; = +1 and HH" = nl,,. 

It is easy to find necessary and sufficient conditions that AA’ = ml, have 
an integral solution A of order n. These are: (1) if n is odd, m is a square; (2) if 
n = 2 (mod 4), m = a? + b’; (3) ifn = 0 (mod 4), m is any positive integer. 
These conditions are established in Section 2, and are necessary for the 
existence of a rational A and sufficient for the existence of an integral A. 

If X is the r by n integral matrix consisting of the first r rows of an A for 


+ This research was supported in part by NSF Grant MPS 72-05035 A02. 
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which AA’ = ml, then necessarily XX = ml,. The difficult and challenging 
problem is that in which we are given an r by n matrix X with XX‘ = ml, to 
find whether or not there is an A with AA‘ = mI that has Χ as its first rows. 
This we call the completion problem. In Section 3 using results of [2], we can 
show that there is always a rational completion (m and n satisfying the 
preceding conditions), that is an A with X as its first rows and its remaining 
rows rational. 

In Section 4 it is shown that if there are only one or two rows to be added 
to X, then there is an integral completion A. In Section 5 the case of Hada- 
mard matrices is examined more carefully and it is shown that if at most four 
rows need to be added to an X there is a Hadamard completion. But there 
are Hadamard X’s that do not have Hadamard completions, though they 
may have integral completions, as an example shows. 


2. Existence Conditions 


Following the methods of Chowla-Ryser [1] we can easily determine 
exactly conditions for the existence of an integral square matrix A of order n 
such that A’A = ml. 


Theorem 2.1 There exists an integral square matrix A of order n such 
that ΑἿΑ = ml, ma positive integer if and only if: 
(i) for n odd m is a square, m = mj; 
(ii) for n= 2 (mod 4) m is a sum of two integral squares, m = a* + b?; 
(iii) for n=0 (mod 4) m is any positive integer. 
Proof Let A = [α,}], i, j = 1,..., n. Let x,,..., x, be indeterminates. Let 
us write 


L, = ¥ ayjx;, {ΞΕ λυ Ἦ: (2.1) 
j 


Then the matrix equation 
A'A=ml (2.2) 
is equivalent to the identity 
Li +°7°+ 15 = αὶ ἘΠ::- x2), (2.3) 
as is well known from the theory of quadratic forms, but a direct check is 
easy in this case. 
Taking determinants of both sides of (2.2) we have 
(det A)? = m" (2.4) 


from which it readily follows that if n is odd, then m must be a square 
m = mj, then taking A = my, I, Eq. (2.2) is satisfied. 
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From here on let us suppose that n is even. Then from the theorem of 
Lagrange} any positive integer m can be written as a sum of four integral 
squares 


m = bi + b3 + b3 + bi (2.5) 
and we have identically 


(bi + b3 ἘΝ b3 ἘΣ b4)(x? ae Sar a ΧΙ ΒΝ ΧΙ 5) = ye zs ὙΠ τῇ ἡ ἘΝ γὴε3 
(2.6) 


where 
yy = bX; — bz Xi41 — b3X;42 — Da Xi+3 
Vier = 52x; + By Xi41 — D4Xi42 + O3Xi43 
Yit2 = b3x; + Dg xXi41 + D1 Xi42 — b2Xi43 
Vi+3 = b4x; — b3X;41 + b2Xj42 + Oi X43. (2.7) 
We note that if we define a matrix B by 


b, —b, —b; —b,y 
δ; δι —b, ὃ. 
. bs  b, —b, |? 
δ, —b; b, δι 


B= (2.8) 
then by direct calculation 
B'B = BB'(bj + b3 + b3 + b4)I. (2.9) 


From this it follows that det B= +(bj + b3 + b3 + b3)? and since δὲ has 
coefficient + 1 in det B that det B = (bj + b3 + b3 + bi)*. Hence from (2.7) 
we can express the x’s in terms of the y’s rationally with denominator μι, 
where m = δὲ + b3 + ΡΞ + Ρ2. 

We also note that if n = 0 (mod 4) and if m = b? + b3 + b? + b2, then 
we can take a matrix A as the direct sum of n/4 copies of B: 


aE 0 
A= 018) (2.10) 


0 
and from (2.9) it follows that 
ATA = AAT = ml. (2.11) 


This covers part (11) of the theorem. 


+ The arithmetical results used in this section may be found in Hardy and Wright [3]. 
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It remains to consider the cases when n = 2 (mod 4). We can apply (2.6) 
to the x’s in (2.3) four at a time to obtain the identity 
Lites + LR = yi te + yaa + m(x,~1 + x2). (2.12) 


Here y,,..., ¥,—2 and x,,_ 1» x, are independent indeterminates, and Ly, ..., 
L,, are rational linear forms in these indeterminates. We can determine y, as 
a rational linear combination of y,, ..., ¥,-2, Χη-- 1. X, by putting 


L,=+y; (2.13) 
where we take L, = y, if the coefficient of y, in L; is not +1, and putting 
L, = —y, if it is. Using (2.13) to express y, in terms of y2,..., )},.-- 2. Χῃ-. 1» Xn 
we have from (2.12) 

L34+--°+ L2 =y3 t+ + ya-2 + πιῇ. + x’), (2.14) 


an identity with L,,..., L, rational linear forms in independent indetermin- 
ateS y>, .--5 Va—-20 Χη--1» X,- Similarly, we put L, = +y, to express y, as a 


rational linear form in y3, ..., Y,-2, Xn,—-1, X,- Continuing this process we 
finally obtain an identity 
L2_, + L2 = m(x?_, + x?) (2.15) 


with L,_ , and L, rational linear forms in x,,_ , and x,,. Taking x,,_ ; and x,, as 
nonzero multiples of the denominators in L,_, and L,, we can consider 
(2.15) as an equation in nonzero integers. From this it follows that m itself is 
the sum of two integral squares 


m= a? + b?. (2.16) 


Conversely, given (2.16) and n = 2 (mod 4) we can define a matrix C of 
order 2 
a b 


fis ee Ἂ (2.17) 


and taking A as the direct sum of n/2 copies of C we shall have 
AA* = A'A=ml. This completes the proof of part (ii) of the theorem in- 
cluding the converse. Thus all parts of the theorem are proved. 


3. Rational Completions and Orthogonal Equivalence 


Let A be a square matrix of order n and let X be the r by n matrix 
consisting of the first r rows of A so that 


A= bd (3.1) 


where W is the n — r by n matrix consisting of the last n — r rows of A. Let us 
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suppose further that every row of W is orthogonal to every row of X. In 
matrix form this is the condition 


XW =0. (3.2) 
Then 
X Xx™ xwt D, 0 
T= xT = re Nie ΞΕ 
ἝΩ Μ᾿" a pal ti ἢ hd ΕΘ ΟΣ 
(3.3) 


The case in which we are particularly interested is that in which AA‘ = ml,. 
Here 


XXT=D,=ml,, WW'™=D,=nl,.,. (3.4) 


The completion problem in which we are interested is the following: 
Given an r by n integral matrix X such that XX‘ = ml, can we find ann — r by 
n matrix W such that A = [Χ] satisfies AA‘ = nl,? 

If W is rational, we say that A is a rational completion of X and an 
integral completion if W is integral. 

Providing that the conditions of Theorem 2.1 of the preceding section are 
satisfied it is immediate from a result of the author and H. J. Ryser [2] that 
any X satisfying XX‘ = ml, has a rational completion. We quote Theorem 
2.1 of that paper. 


Theorem (Hall-Ryser) Suppose that AA’ = ἢ, ® D,. Here the matrix 
A is of order n and nonsingular. The matrix D, is of order r and D, is of order 5 
where r+s=n. Let X be an arbitrary r by ἢ matrix such that XX" = D,. 
Then there exists an n by n matrix Z having X as its first r rows such that 
ZZ' = D, ®D,. This result holds for all fields F of characteristic + 2. 


Theorem 3.1 Suppose that X is an r by ἡ integral matrix satisfying 
XX" =ml,, and that m and n satisfy the conditions of Theorem 2.1. Then 
there is ann by n matrix Z having X as its first r rows and having its last n — r 
rows rational such that ZZ* = μι]... 


This follows directly from the Hall-Ryser theorem, taking F as the rat- 
ional field D,=ml,, D,=ml,_,, and A a solution of AA'T= 
ml, = ἢ. Ὁ ὃ. which exists since the conditions of Theorem 2.1 are 
satisfied. 

The following theorem is an easy generalization of Theorem 2.2 of the 
Hall-Ryser paper. 


Theorem 3.2 Suppose that AA' = D, ®D, where A is of order n and 
nonsingular, and D, and D, are of order r and n—r and are nonsingular. 
Suppose further that X and Y arer byn matrices such that XX" = YY" = D,. 
Then there exists an orthogonal matrix U of order n such that XU = Y. This 
result holds for all fields F of characteristic different from 2. 
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Proof From the Hall—Ryser theorem there exist n by n matrices R and S 


such that 
X Y 
R = ‘ ΞΞΞ 
mh τίη 
with W,, Μὰ n—r by n matrices such that 
RR'=D, @®D,, 551] =D, @D,. (3.5) 
Putting U = R™'S it follows from (3.5) that UU™ =] or that U is ortho- 
gonal. Then 
I, 0 I, O 
n= [τ 8 08 
or 
XU=Y, (3.7) 


which is the conclusion of the theorem. 
4. Results on Integral Completions 


Theorem 4.1 Suppose that m and n satisfy the conditions of Theorem 2.1. 
Then if r=n—1 or n—2 and X is an integral r by ἢ matrix satisfying 
XX* = ml, there is an n by n integral matrix A having X as its first r rows 
such that AA‘ = ml,,. 

Proof For r=n-— 1, the result is easy. Let A be a rational completion 
having X as its first n — 1 rows, whose existence is given by Theorem 3.1. 
Then in fact A is an integral matrix. 


A=[a,], AAT=ml,, A'A=mil,. (4.1) 

The last follows since mA~ 1 = A’, and yields 
Yaz=m jH=l,...,n. (4.2) 
But aj ;, a2;,.--, d,— 1; are integers as entries in X. Hence aj, is an integer but 
as ας; is rational it follows that a,; is an integer for j = 1, ..., n. Thus A is an 


integral matrix and so is an integral completion of X. 
Now suppose r = n — 2 and let Z be a rational completion of X as given 
by Theorem 3.1. Then 


ZZ'=ml,, Z'Z=ni,. (4.3) 


Here the first n — 2 rows of Z are integral, and the last two are rational. Let 5 
be the smallest positive integer such that sZ = A, is an integral matrix. Then 


A, ΑἹ = AiA ΞΞΞ s*ml,, . (4.4) 
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Here if X = [a,,],i=1,...,n—2,j=1,...,, A; has the shape 


SQ14 SQ, j SQin 
SQ 71 562) 5625 
Α, -- : : : (4.5) 
SQy-21 Sn - 2] SQn—2n 
Xi Xj Xn 
Yi yj Yn 


where all entries are integral. Clearly if s = 1, then A, is an integral com- 
pletion of X. We shall show that the least positive s giving a matrix such as 
A, is s= 1. From 414; = s?ml,, we have 


s*(aj, t+ t+az_a)+xpt+yj=s’m j—l,...,n 
5 (α,)αχκ ἘΠ᾽" + Ag—2j4n— rn) + XjXe + γι =O; 75 Κ. (4.6) 
Let p be a prime dividing 5. Then (4.6) gives 
x? + y?=O0(mod p”), xX, + yj yx = 0 (mod p’), 
καῖ, νιν, ας, jek (47) 
Then if p = 2 or p = 3 (mod 4) it follows from (4.7) that 
xX;=y,;=O0(mod p), j=1,...,n. (4.8) 


In this case ρ΄ !A, = A, is comparable to A, with s = ps, replaced by s,. We 
may suppose that p|n but that p does not divide every x and y in A,. Choose 
j so that p does not divide both x; and γ;. Then from (4.7) p divides neither x, 
nor y;. 


x} + yj =O (mod p’), x; #0(p), y; #0 (p). (4.9) 


As p $2, p $3 (mod 4) it follows that p is a sum of two squares and is not a 
prime in the Gaussian integers 


p=a’?+b?=(a+ bi)(a — bi). (4.10) 

From (4.9) 
(x; + y;i)(x; — y;i) = 0 (mod(a + bi)*(a — bi)’. (4.11) 
Since x;, y; #0 (mod p), it follows that x; + y;i is not divisible by both of 
a + δὶ ἀπά a — bi and also that x; — y;i is not divisible by both of a + bi and 
a — bi. Let us suppose the notation such that (x; + y,i, a — bi) = 1. Then 


(a + bi)? = (a? — b? + 2abi) = d + ei divides x; + y,i, where 42 + 62 = p?. 
Then 
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with w, and z, rational integers. It follows that 
(d — ei)(x; + y,i) = (dx; + ey,) + (—ex; + dy,)i = p?w; + p?z;. (4.13) 
Now consider for k # j the identity 
(xj X4 + Yj Ve)? + (Xie -- χα» ") = OF + YFKE + Ve). (414) 
From (4.7) we have x? + y?, xf + yg, XjX_ + yj y, all multiples of ρ΄ so that 
from (4.14) x;y, — x,y; is also a multiple of ρ΄. 
Now consider the product 


(x; — yji)(Xn — Yet) = jy + ViVn) + (—Xje + χα») = peu + prvi. 
(4.15) 
Since (x; + y,;i,a—bi)=1 and p? = (a+ bi)’(a— bi)’ it follows that 
(a — bi)? = ἃ — ei divides x, — y,i. Hence for the conjugates ὦ + ei divides 
x, + y, i. Thus 
Xx + Vz = (d + ei)(w, + z,i) (4.16) 
with w, and z, rational integers. Then 
(d — ei)(x, + Yai) = (dx_ + γι) -- ex, + dy,)i = P?wy + p°Zi. (4.17) 
Now write s = ps, and define the matrix M, by 


pl,—2 
M,= | ------- bon =-- == (4.18) 
t de 
'-ρ d 
It is immediate that M, M} = p7I,,. Using (4.13) and (4.17) for k # j, we have 
ΡΙ,..,} ps, X 


-...--.---.-.-.μ-.-.---.-.......- 11} el 


d ellx, xX; X, 


M,A, 


p7s,X 
dx, tey, °°: ax;+ey, τ“ dx, tey, °°: dX, + Vp 
—ex; +dy, ++ —exj+dy; τ —ex,tdy, τ: —ex, + dy, 
p*s,X 
᾿ 2 2 
pw, Ρ Wj pW, pW, 
pz; px; Pp 2, ΡΖ, 
δι Χ 
- Gildea ae et = p’A,. (4.19) 
Wi one W; eee n 
Ζὶ Ζ; Zz 
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Here A, is an integral matrix with the properties of A,, but with s= ps, 
replaced by the smaller s,. Hence the smallest s appearing in such an A must 
be s=1 and in this case A is an integral completion of X, proving our 
theorem. 


Theorem 4.2) [fm and nare integers satisfying conditions of Theorem 2.1 
and if X is anr by n integral matrix satisfying X X* = ml,, thenifn < 4 there 
is always an integral completion A of order n with X as its first r rows such that 
AA' = ml,. 

Proof This is essentially a corollary of Theorem 4.1 which covers all 
cases with n < 4 except for r = 1, n = 4. But in this case taking X as the first 
row of a matrix B as in (2.8) we have an integral completion of X. 


5. Integral Completions for Hadamard Matrices 


A Hadamard matrix H = [h,,], i,j = 1, ..., n 1s a square matrix with 
every entry h;; = +1 such that 
HH'=nl,, HH =nl,. (5.1) 


These two conditions are equivalent as both of them assert that nH~' = Η“. 
It is well known that for a Hadamard matrix n = 1,2 orn = 0 (mod 4). The 
natural question here is whether or not given an r by n matrix X with every 
entry +1 such that 


XXT =n, (5.2) 


does there exist a Hadamard matrix H with X as its first r rows? 

Leaving the cases n = 1, 2 as exercises for the reader, we shall assume 
ἢ =0 (mod 4) so that the conditions of Theorem 2.1 are automatically 
satisfied. 


Theorem 5.1 Let X be anr by n matrix, n = 0 (mod 4) with every entry 
+1 or —1 satisfying 5.2. Then if r=n-—1,n—2,n— 3, orn—A4thereisa 
Hadamard matrix of order n with X as its first r rows. 


Proof Let H, be a rational completion of X. Then H,H! = 
Στ, Vitghy=0, ῥκοεῖ, ν,η, f#k (53) 
{Ξ 1 {Ξ} 


It follows that 


Y μόπη--δ᾽βῆεξη -- τ, jHl,....n (5.4) 
t=1 


t-=r+1 
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and 


Y hijhe= -- Mhyjhe = Σ᾽ + 1=r (mod 2). (5.5) 
t=r+1 t=1 t=1 

In finding a completion for X it is clear that we may permute the rows of 
X or change signs of some of them without altering the final rows of the 
completion. Also if we permute the columns of X and change the signs of 
some of them, if we perform the same operations on the columns of the final 
rows, we shall still have a completion. Since these operations take integral 
completions into integral completions and are reversible, we may use them 
freely in deciding whether or not a specified X has a completion. 

If Y and Z are two rational g by n—r matrices with YY' = ZZ' a 
nonsingular matrix, then by Theorem 3.2 there exists a unitary matrix U of 
order n—r such that YU = Z. Taking transposes we have 01 = ΖΤ, 
Hence we may define H* by 


eal aa 


Then H? is also a rational completion of X and if the first g columns of W 
are the matrix Y", then the first g columns of U'W are the matrix Z!. This is 
very useful since in the cases we shall consider we can replace rational 
columns Y' by integral columns Z‘ and obtain an integral completion. 

First, if r=n— 1 from (5.4) in the rational completion H, we have 
hi, =1,j=1,...,n, and soh,; = +1,j=1,..., n, and thus H, = H is the 
completion which is a Hadamard matrix. 

If r= n— 2 write H, in the form 


xX 

Bea ee 5.7 

‘a ee (5.7) 
» »2 Yn 


Here (5.4) and (5.5) give 
xot+yp=2, ,-1,..., ἢ, xX t+ yy =2s, JH (5.8) 


where 2s is some even integer and since (x; + x,)” + (y; + γι) Σ 0 we have 
—2 <2 XjX_, + Vi V, <2. 

By Theorem 3.2 as x? + y? = 2 we can apply (5.6) to replace H, by a 
completion H¥ in which (x,, y,) = (1, 1), and without loss of generality 
assume this to be the case in H,. Thus from (5.8) with j # 1 and k = 1 


χ᾽ ἘγΞ 2, x,t yj = —2,0,2. (5.9) 
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Ifx;+ y; = 2, then 
(x; — 1)? + (ν, - 1)? =2-44+2=0 (5.10) 
and x; = 1, y; = 1. Similarly if x; + y, = —2, then 
(x; + 1)? + (y;+ 1? =2—-4+2=0 (5.11) 


and x,;= —l, y,;= —1. 

Ifx; + y; = +2 for all j, then from (5.10) and (5.11) the last two rows are 
identical. This is impossible since the inner product of the last two rows is 
zero. Hence for some j we have x; + γ; = 0 and then 


Thus without loss of generality, we may take x, = 1, γ) = —1. Now H, has 
the form 
Χ 
=|... 12 
ΠῚ 1 1 ᾿ Xj ᾿ Xn (6 


Here for j > 2 we have from (5.9) with k = 1 and k = 2 
xptyp=2, x +y;=0,+2, x,-y,=0,+2. (5.13) 


Here the only possibilities are (x,, y;) = (1, 1), (—1 —1), (1, -- 1), and 
(—1, 1). Hence H — H, has every entry +1 and so is a Hadamard matrix. 
We might also have appealed to Theorem 4.1 and taken H, as an integral 
completion of X and then x? + y}? = 2 would give x; = +1, y; = +1, and so 
H, is an Hadamard matrix. | 
Now take r = n — 3 and write the rational completion H, in the form 


xX 
Hes | See ese 
ΧΙ, Xj Xn 
yi yj Yn 
24 Ζ; Se 


Here (5.4) and (5.5) give 
xj + yj + 27 = 3, XjX_ + VIVE Ἔ 252, = =; =. 1, 3, 
Rk=l1,....n, f#k (5.14) 


since x;x;, + yj), + 2,2, 15. an odd integer in absolute value at most 3. 
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Since xf + yj + z{ =3 by Theorem 3.2 we may apply (5.6) to replace 
(x1, Y1, Z;) by (1, 1, 1), and assume this to be the case in (5.13). Then from 
(5.14) with j # 1 and k = 1 we have 


xptyft2az=3, x, +y,+2;= --βλ, -1, 1,3. (5.15) 


If x;+yj;+2,;=3, then (x,;— 1)? +(y;- 1)? (7; -- 1)? =0 and 
(x;, yj, Zj) = (1, 1, 1) and similarly if x; + γ᾽ + z; = —3, then (x;, »;» 2;) = 
(—1, —1, —1). Hence unless x; + y; + z; = +1 for some j, the last three 
rows are identical, which conflicts with their inner products being zero. 


Without loss of generality take j = 2 and x, + y, + z, = 1. Then defining Y 


and Z by 
1 1 1 1 1 1 
Υ̓ -Ξ- : L= ‘ 5.16 
Ν yo i 1 ἢ ( ) 
we have 
ἣν ἢ 3. 1 
YY" =z, Lc= 1 4} (5.17) 


Hence from Theorem 3.2 there is a rational orthogonal matrix U with 
YU = Z. Hence applying (5.6) we have a rational completion H* with 
(x1, ¥1, Ζ4) = (1, 1, 1) and (x2, y2, 22) = (—1, 1, 1) which we may assume 
to hold in H,. 

Then with j > 2 and k = 1 and 2 (5.14) becomes 


x?+y?4+727=3, x, +yj+72,= 41, +3, 
=X jar 2; = τ 1, + 3. (5.18) 


We note that if the inner product of two columns of the last three rows is 
+3, then the columns are identical, while if the inner product is —3 then 
they are negatives of each other. If this held for every j > 2, then the last two 
rows would be identical, a conflict. Thus for certain 7705 


X;+y +2, =6, = +1, —xXj+tyjt2Z,=6= 41. (5.19) 
If δι ΞΞ 8χ, then x; =0; while if ¢, = —é,, then x, = €,. Hence every x,, 
j=1,...,n is Ὁ or +1. But since xj + +--+ χ =n, the alternative x; = 0 


cannot arise and x; = +1,j = 1,...,n. We can now add the row x,,..., x, to 
X to get X’ and by our previous proof complete X’ with two more rows to a 
Hadamard H. But we may also proceed directly taking j = 3,2) = 1 without 
loss of generality. Then with 


1 1 1 1 1 1 
Y=]-1 1 1 and Z={-1 1 1 (5.20) 
X3 V3 23 1 1-1 | 
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we have 
3 1 1 
yyt=zz'=|1 3-1 (5.21) 
1-1 3 
and with U = Y~!Z apply (5.6) to replace (x3, y3, 23) by (1, 1, — 1). Here 
xX 
H,=|1-1 1 xj χρί, (5.22) 
1 1 1 Vi Yn 
1 1-1 Zy 2s 


and we assert that H, is a Hadamard matrix. For we have for j > 3, 
x? + y? + z7 = 3, 
x, ty, +2,= £1, +3, —xX,;+yjt+2,= +1, +3, 
xjtyj—2,;= +1, +3. (5.23) 


If we have +3 in any one of these cases, the jth column is equal to one of the 
first three or its negative. There remains 


Xjtyj—2j=€3 = ἘΠ. (5.24) 
As observed earlier we must have 8, = -- ὃ) and x; = 8;. Similarly, ife, = €3, 


we have z;=0; while if ¢, = —e3, we have z; = δ). And since we must 
exclude z; = 0, we have ὃ) = —&, 83 = —&,, and 


(x;,y;,2)=(1,-1,1) or (—1, 1, -- 1} (5.25) 


Hence every x, y, and z is +1 and the completion of (5.22) is a Hadamard 
matrix. 

The case with r = n — 4 is more difficult to handle: We write a rational 
completion H, as 


xX 
HA, = Xi ed Xj ca Xn . (5.26) 
γ᾽ ele yj ca ΣΝ 
21 2; Zn 
Wi, Wj γν, 


From (5.4) and (5.5) we have 


x} + Yj + ZF πῇ Ww; = 4, XjX_ + Vi Me + 252, + WW, = 0, +2, +4. 
(5.27) 


132 | MARSHALL HALL, JR. 


We subdivide the proof into two subcases. In the first subcase we assume 
that only the inner products 0, +4 arise. We may apply Theorem 3.2 and 
(5.6) to replace (x1, yy, 21, W,) by (1, 1, 1, 1). If the jth column has inner 
product +4 with the first column, it will be the same as the first column or 
its negative. Since the last four rows are not identical, there will be a j for 
which x; + y; + z; + w;=0, which we may take to be j = 2. Then from 
Theorem 3.2 and (5.6) we may replace the first two columns by (1, 1, 1, 1) 
and (1, 1, —1, —1). Since the last two rows cannot be identical, there must 
be a column whose inner product is not +4 with either of these, and by our 
assumption this means a column whose inner product is zero with both of 
these. Without loss we may take this to be the third column and by Theorem 
3.2 and (5.6) replace (x3, y3, 23, w3) by (1, —1, 1, —1). Now H, is in the 
form 


X 
H,=]1 1 1 ee ca Πςς (5.28) 
1 1-1 Yi °° Vn 
i-1 1 Ζ; Ze 
i -1 --Ἰ Wj γν, 


Now we assert that H, is a Hadamard matrix. For x} + yj + 2? + w? =4 
and if this column has inner product +4 with one of the first three, it is equal 
to that column or its negative. But if it has inner product zero with all three 
of the first, then we find (x,, y;, z;, w;) = (1, —1, —1, 1) or (-1, 1, 1, —1) 
since x; = —y,; = --Ζ; = w; from the vanishing of the inner products. Hence 
in the first subcase there is a completion that is a Hadamard matrix. 

In the second subcase there will be an inner product +2. Without loss we 
may take this to be +2 and the inner product of the first two columns. 
Hence by Theorem 3:2 and (5.6) we may take H, in the form 


X 
A,=]1-1 Χο. ἌΧ (5.29) 
1 1 yj τ Yn 
1 1 Z; Ze 
1 1 Wj Wr, 


Now let us change the signs of rows of X to make the first column consist 
entirely of +1’s. Since the inner product of the first two columns of H, is 
zero, writing n = 4t we find in X 


21 — 3 rows, 1, 1,..., 21 — 1 rows 1, —1,.... (5.30) 


In X we shall subdivide these rows according whether the entry in 
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column j is +1 or —1 


column 1, 2, j 
arows 1 1 1 
brows 1 1 -ἰ (5.31) 
crows 1 --| 1 
drows 1 —1 ~1 


From (5.30) we have 
a+b=2t-3, c+d=2t-1. (5.32) 


We now calculate the inner product of the jth column c; with the first and 
second columns c, and c,: 


(c,,c))=a—b+c—d+x,+y,+2,;+w, =0 
(c2,c)=a—b—ct+d—-—x,;+y,;+2,;+w,=0 (5.33) 
It now follows that 
xj=—c+d=2t-—1-2e¢. (5.34) 


Hence x, is an odd integer and as x} + yj + 2? + w7 = 4 that necessarily 
x; = +1. We may now adjoin the row of x’s to X to form X’ ann — 3 by ἡ 
matrix of +1’s and by our previous case there is a Hadamard matrix H that 
is a completion of X’. This completes the proof of the second subcase for 
r =n— 4 and so all parts of the theorem. 


It is not true that every X with entries +1 satisfying (5.2) can be 
completed to a Hadamard matrix. We shall show what some of these X’s 
are. There is, up to equivalence of permuting rows and columns, and chang- 
ing signs of rows and columns an unique Hadamard matrix H, of order 4: 


1 11 1 

Heed 1--]. (5.35) 
Nes aes eee 
Late. ἢ] 


If we take the tensor product of e, = (1, ..., 1) with ¢ 1’s, t being odd, with 
H, we have an X, 


Xa=fderd der 1 -bee τὸ -ἰ- - (5.36) 


where X is a 4 by 41 matrix such that X XT = 411,. 
Since H, in (5.35) is normalized so that first column consists entirely of 
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+ 1’s and since the columns are orthogonal, the sum of the entries in every 
other column is zero. Hence the sum of the four rows of H, is the vector 
(4, 0, 0, 0). Correspondingly the sum of the four rows of X is (4e,, 0, 0, 0). 
Consequently, a vector (x, ..., X;,---, X4,) in which every x; = +1 cannot 
be orthogonal to all four rows X since then it would be orthogonal to their 
sum (4e,, 0, 0, 0) and then in (x,, ..., x,) half the entries would have to be 
+1 and other half —1. But with t odd, this is impossible. In the same way 
the tensor product of e, with any Hadamard matrix H,,, of order 4m will give 
an X that is 4m by 4mt which cannot be completed to a Hadamard matrix 
H4m since the existence of a completion of X is independent of the normali- 
zation of H,,,. 
In the light of these observations the matrix A (5.37) is of interest. 


1 ft 21 241 1 21 tf 1 +21 ~=421 ~«1 
1 ft 21 1 1-1-1 -1-1-!1-!1 
1 i-i-1-1 1 11 1-1-1 -1 
1 i1-1-1-1-1-I1-1 
—2 0 1-1 0 1-1 0 
0-2-1 1 0-1 1 O 
2-2 1- 0 1-1 0 
0 0 1 0-1-1 0 1 
0 0-1 0 1 1 0-1 
1 

1 


(5.37) 


0 0 1 O-1-1 0 
0 O-1 2-1 1-2 
0 0-1-1 2-1-1 2 


Here AA™ = ΑΤΑ = 12] so that although the first four rows X of A are 
such that they cannot be extended to a Hadamard matrix, nevertheless A is 
an integral completion of X. 


ΞΘ  Φ Φ ΦιιυοΟ Ὦ μὰ κ = = 
GCOOoOONNGCO SO} 

| 

NO 
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Lie Algebraic Proofs 
of Some Theorems on Partitions 


J. W. B. HUGHES 


QUEEN MARY COLLEGE 
LONDON, ENGLAND 


An analysis is given of the layer structure of fundamental representations of the simple Lie algebras, 
expressions being given for the multiplicities of the layers in terms of the partitions (j; l, i), 
P(j; n, n), and Q(j’; a, b, c). A general theorem due to E. B. Dynkin concerning the layer structure 
is then used to derive certain properties of these partitions; a typical example is the result 
P(j; n,n) > P(j — 1; n, n) for j < [πίη + 1)]. 


1. Introduction 


In a previous paper [1] a proof was given of a theorem on the partition 
function (j; n — i, i) of an integer j into at most n — i integers none of which 
exceeds i, namely that, for 0 <j < [i(n — i)/2], j; n—i, i) > G -- 1; n—i, i); 
the connection of this theorem to the layer structure of fundamental re- 
presentations of the Lie algebra A(n — 1) was also mentioned. The proof was 
unfortunately quite erroneous,{ and in fact properties of this nature appear 
to be quite difficult to prove using conventional means (see for instance 
Chowla [2, unsolved problem 68] and Szekeres [3, 4]). However, by a rever- 
sal of the argument, known properties of the structure of these representa- 
tions could be used to give a Lie algebraic proof of the above theorem. The 


1 am indebted to Professor G. E. Andrews of Pennsylvania State University for pointing 
this out to me. 
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theorem was in fact already proved by Elliott [5] using invariant theory of 
quantics, but by considering the layer structure of representations of the 
other simple Lie algebras, new theorems concerning different partitions can 
also be derived. 

It is the purpose of this paper, therefore, to give the analysis of the 
fundamental representations of A(n — 1), B(n), C(n), and D(n), and in parti- 
cular to calculate the multiplicities of the jth layers of these representations. 
In all cases they are found to equal certain partition functions, the above 
(j; n — i, i) for A(n — 1), the partition function P(j; n, n) of j into at most n 
strictly decreasing integers none of which exceeds n in the case of the nth 
fundamental representation of B(n) and the (n — 1)th and nth fundamental 
representation of D(n). For the cases of the other fundamental representa- 
tions of B(n), C(n), and D(n), these multiplicities are given in terms of the 
partition function Q(j’; a, b, c) which denotes the number of partitions of 7 
into exactly a strictly increasing integers none of which exceeds ἢ and in 
which c of the integers contribute negatively. 

Using a theorem by Dynkin [6], which states that these layers are 
“spindle shaped,” five theorems, each in two parts, are derived. The first part 
of each theorem is trivial and can be derived very simply, in the first two 
cases from elementary properties of the appropriate generating functions, 
and in the other cases from the obvious relation 


Q(j'; a, ὃ, c) = Ο(-- 7": a, b, a — c). (1) 
The second parts of the theorems, on the other hand, do not appear to be 
easily derivable using conventional methods and are, as far as the author 15 
aware, with the exception of Theorem 1b, all quite new. Theorem 1 concerns 
(j;n—i, i); Theorem 2 concerns P(j; n,n); and Theorems 3-5 concern 
Q(j'; a, ὃ, c). 

Before giving these Lie algebraic derivations, in order to place the results 
of this paper in perspective we show how some elementary properties of the 
above partitions can be derived using their generating functions. For (j; n — 
i, i) the generating function is the Gaussian polynomial 


n (Egy) oe ies. προων 
N= oh a τ π- (j; n — i, ἢφ (2) 
Seance mae τὰ 
(see Hardy and Wright [7], Rademacher [8]) where one conventionally 
defines [§](q) = 1. The Gaussian polynomials are reciprocal and satisfy [8] 


rio 0) : 


The reciprocal nature of [7](q) will turn out to be precisely the content of 
Theorem la. Also, the use of (3) in (2) gives 


ee [ ) (4) 


)Ξ 
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which, as we shall see in Section 2, states that the ith fundamental represen- 
tation of A(n — 1) has dimension ("). 
The generating function for P(j; n, n) is 


Fee Mig = α Ὁ 4 Ὁ 4): Ὁ 4) 


n(n+1)/2 


= 2 P(j; n, nq’. (5) 


Theorem 2a will turn out to be the statement that this, too, is a reciprocal 
polynomial. Also, putting q = 1 in (5) gives 


n(n+1)/2 n n 
δ P(j;n,n)= ΣΙ ) ΞΕ 7: (6) 
j=0 r=o\l 
which states (see Section 3) that the nth fundamental representation of B(n) 
has dimension 2". 


Finally, for Q(j’; a, b, c) we have the generating functiont 


b a0 
[]Qd+xq+xtq")= Y YY Os;a,}, c)x*t°q’. (7) 
r=1 a,c>0O γ΄)ἑέἑξπ ὦ 

It is worth noting here that while Q(j’; a, ὃ, c) arises naturally in the context 
of this paper, its definition as a partition function is somewhat artificial. 
However, let Q*(j’; a, b, c) denote the number of partitions of j’ with each 
part no larger than b, with no part appearing more than twice, with a parts 
appearing exactly once, and with c integers used as parts. Then 


b+ 1 
of’ - | ᾿ |ib-a b, b—¢} = Q*(j’; a, b, c). (8) 
This is obvious from the following elementary identities: 
b+ 1 
Σ of7 = | : ) b — a,b, b -- c} tg! 


b+1 
= δ : κὺρ >. of a [ 5 ') a, b, c) anata! 


(1 1) b 
- 4. 2 ’xPPT] (1+ x7 tg? + x7 τ 4.) 
r=1 
b 
= [J qxt(l Ἐχ 'φ +x7't719q7') 
r=1 
b 
= [] (1 + xtq’ + tq") =), Οὐ Ὁ": a, δ, οὐχ 4. (9) 
r=1 


t I should like to thank the referee for pointing this out to me. 
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Thus Q is only a superficially altered Q*, and Q* is a very natural partition 
function. 
If in Eq. (7) we put q = 1 and equate coefficients of x*t*, we obtain 


Σ Οὐ"; a, b,c) = [ [2 (10) 


)}κκ-το a Cc 


This result, as we shall see later, may be used to calculate the number of 
distinct weights belonging to shells of fundamental representations of the Lie 
algebras B(n), C(n), and D(n). 

In dealing with A(n — 1) in Section 2, we also summarize, without proofs, 
those properties of simple Lie algebras that are needed in order to calculate 
the layer multiplicities. The Lie algebras B(n), C(n), and D(n) are treated 
more briefly in Sections 3, 4, and 5, respectively. 


2. A(n — 1) 


In this section we consider the case of A(n — 1), which we treat in some 
detail in order to summarize well-known facts about simple Lie algebras. 
Facts valid for arbitrary simple Lie algebras will be given under the heading 
of “Property,” and their proofs will not be given here since they are readily 
available in the literature [6, 9]. We shall reserve the title of “Theorem” for 
properties directly related to partitions. The forms used here for the roots 
and weights of A(n — 1) will be derived, whereas in the following sections 
their forms for B(n), C(n), and D(n) will be written down without deriva- 
tions; all the forms for the roots and weights used here are given in [6]. 

We shall use a realization of the (n? — 1)-dimensional Lie algebra 
A(n — 1) in terms of n x n matrices, choosing as its basis the traceless dia- 
gonal matrices h,, i = 1,..., n, where h; has (n — 1)/n as its (i, i)th element 
and — 1/n for its other diagonal elements, and the e,,,j # k = 1,...,n, where 
e,, has 1 for its (j, k)th element and 0 for its other elements. The h,;, which 
span the Cartan subalgebra of A(n — 1), satisfy the relation h, Ἔ 
h, = 0, so only ἢ — 1 of them are linearly independent. We find it convenient 
to use the above linearly dependent set as opposed to a linearly independent _ 
subset since they give rise to extremely simple forms for the roots. These 
forms will turn out to be n-tuples, like the forms for the roots of the other Lie 
algebras, and it is for this reason that we consider A(n — 1) rather than A(n). 

The commutation relations of the h;, ej, are 


[h; , hj] = 9, ᾽ν 7 ΞΞΕῚ (11) 
[hi , ej] = (δι; — Onde jr » a Dae n, j#ek=1,..., n, (12) 
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and 
h; - h, 9 ἰ = k, m =] 
C jim > =k, m#j 
Cn, Cim| = 13 
[ein im Sige: 1+k, m=j ( ) 
0, l#k, mFj. 
Using the shorthand notation ἢ = (h,, ..., h,), we may write (12) as 
h, 6κ] = jk © jx (14) 
and the first of Eqs. (13) as 
[ein » xj] = On "ἢ (15) 
where 
1 j k n j k 
aj, = (0,...,0, 1,0,...,0, —1,0,...,0) = (1, —1). (16) 
The n(n — 1) n-tuples a, are the roots of A(n — 1). Clearly a,; = —a,, and 


Oj, ἄμ = 2; they also lie on the (n — 1)-dimensional hyperplane 
Χι Ἔ Ἔχ, ΞῸ, 


so only ἢ — 1 of them are linearly independent. We shall choose a basis for 
this hyperplane shortly. 


Definition 1 (a) A real nonzero n-tuple L is called “positive” if its first 
nonzero component, reading from left to right, is positive; otherwise it is 
called “negative.” 

(Ὁ) If Land L’ are two real n-tuples, L is said to be “higher” than L’ if 
the n-tuple L — L’ is positive. It is easy to see that a, is a positive root if 
j <k, and that ifa,, and @,,, are positive, o ,, is higher than αἱ, if either j < |, 
or j=l and μι « Κ. 


Definition 2 A positive root is called “simple” if it cannot be expressed 
as the sum of two positive roots. 


Property 1 The simple roots of a simple Lie algebra form a basis for the 
Lie algebra’s root space, the other roots being linear combinations with 
integral coefficients of the simple roots. 


Definition 3 The number of simple roots is called the “rank” of the Lie 
algebra. 
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A(n — 1) has rank ἡ — 1, its simple roots being a; =4;,;41,i=1,..., 
n — 1, as can easily be verified. An arbitrary positive root a; is given in 
terms of them by 


k-1 
Oh jx = Σ, αὶ a (17) 
i=j 


Definition 4 The group generated by the reflections in hyperplanes per- 
pendicular to the roots of a simple Lie algebra is called the “Weyl group” of 
the Lie algebra. 


Property 2 The Weyl group leaves the set of roots of a simple Lie 
algebra invariant. 


Let L = (1, ..., |,,), and S,,L be the n-tuple obtained from L by reflecting 
it in the hyperplane perpendicular to @;,, 50 


Oy, = L— (aj, * Ljop - (18) 


Using Eq. (16) for «,;,, we obtain 


k 


j 
δ = (1,, ΜΠ Pace li, 219 Ι,} 


so δὲ just permutes the jth and kth components of L. The Wey] group for 
A(n — 1) therefore consists of all permutations of the components of L and 
so is isomorphic to the symmetric group S(n). This clearly leaves invariant 
the set of roots of A(n — 1), and in fact any two roots may be connected by at 
most two reflections: a), = δ Sink % jx - 

We now consider the classification of irreducible representations of 
A(n — 1): Let D be an N-dimensional irreducible representation of A(n — 1) 
with representatives 


D(h;) = H;, D(C jx) = Εκ 
satisfying the hermiticity conditions 
H}=H;, Eh = Eyj- (19) 
These are precisely the conditions required in order that the exponential 
pon D yields a unitary representation of the unimodular unitary group 


We denote the carrier space for D by R and choose its basis to consist of 
simultaneous eigenvectors of the H;: 


Η,Ψ, = HipWp> i=l1,....n, p=1,..., N. 
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Denoting M, = (1p. ---» Unp), this equation may be written in the compact 
form 


Hy,=M,v,, p=1,...,N. (20) 


M, is called a “weight” of the representation D; we denote the set of all 
weights of D by W(D). Because of the hermiticity of the H; (with respect to 
some inner product on R), elements of W(D) are real n-tuples; furthermore, 
from H, +--+ H, =0, we obtain μι, ἘΠ ἜΜ,» Ξ0, p=1, ..., N, so 
elements of W(D) belong to the root hyperplane. 


Definition 5 The “multiplicity” of a weight A ε W(D), denoted y,, is 
the number of linearly independent corresponding basis vectors € R. 


Definition 6 A weight A is “simple” if y, = 1. 


We employ the same method of ordering weights as we did for arbitrary 
n-tuples, and thereby allow consideration of the highest weight of an irredu- 
cible representation D. 


Property 3 The highest weight of an irreducible representation of a 
simple Lie algebra is simple; and if two irreducible representations have the 
same highest weight, they are equivalent. 


The problem of classifying irreducible representations of a simple Lie 
algebra is therefore equivalent to the problem of classifying its highest 
weights. 


Property 4 Any two weights ε W(D) differ by an integral linear com- 
bination of the roots and therefore also, by Property 1, by an integral linear 
combination of the simple roots. 


In particular, therefore, given the highest weight A of W(D), any other 
weight € W(D) may be obtained by a suitable subtraction of simple roots 
from A. 


Property 5 If Me W(D) and α is a root of the Lie algebra, then 
2(α - M)/(a - @) is an integer, and SsM = M — (2(@ - M)/(a - α))α ε W(D). 
This implies that W(D) is invariant under the action of elements of the Weyl 
group. 


Definition 7 Two weights are called “equivalent” if they can be ob- 
tained from each other by an element of the Weyl group. The set of all 
weights equivalent to a given weight is called a “shell” of W(D). 


For A(n — 1), therefore, the shell containing a weight M consists of all 
n-tuples obtained by permuting the components of M. 
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Property 6 If p is an integer such that 0 < p < (a: M)/(a " &), then: 


(a) y(M — pa) = 7(S,M + pa); 
(Ὁ) γί(Μ — pa) > y(M — (p — 1)a). 


This property implies that a chain of weights, M, M-—a,..., 
M — (2(a - M)/(@ - α))α, is spindle shaped, and in particular that equivalent 
weights have the same multiplicity. 


Definition 8 The highest weight in a shell is called a “dominant weight.” 
Clearly the highest weight A is itself a dominant weight. 


For the case of A(n — 1), let M = (μι; ..., μι) be a dominant weight, and 
suppose μ᾽ < μὲ for some j « k. Then 
i k j k 
SM ΞΜ ΞΕ υρ τυ: fy vee) Ξε ρους hes) 


k 


J 
= (0, aang 0, Ly S115: Oh Ness 0, hj; — p, , 9, ..49 0) 


which is clearly positive; since δ} Μ is equivalent to M, this contradicts the 
assumption that M is a dominant weight. Hence if M is a dominant weight, 
its components satisfy the inequalities μὲ > μῃ Σ ΤΠ > Μμ,. 

Now let M be an arbitrary weight ε W(D); by Property 5 applied to 
A(n— 1), M- ay, is an integer for all ἀμ. So μ᾽ — 4 is an integer for all 
j,k =1,..., n. This means that all components of M have the same frac- 
tional part, and also, by Property 4, that the components of all weights 
€ W(D) have the same fractional part. 

The next property, due to Freudenthal [9], enables one to calculate the 
multiplicity of an arbitrary weight. 


Property 7 Let a be a positive root of a simple Lie algebra, 6 half the 
sum of all the positive roots, and A and M the highest weight and an arbitrary 
weight, respectively, of W(D). Then 


(M) — 2. 356 Σ y(M + Κα)((Μ + Κα) - a) 
(A +8)- (A +8) —(M+3): (M+5) 


where »(M + ka) = 0 if M + Κα ¢ W(D). 


We now consider the classification of highest weights A of a simple Lie 
algebra, denoting the irreducible representations corresponding to A by 
D(A), and W(D(A)) by W(A). Now since A lies in the root hyperplane, it is 
uniquely determined by the numbers g; = 2A + @;/a; °a@;, where a; are the 
simple roots. Due to Property 5 and the fact that A is a highest weight, these 
are all nonnegative integers. The numbers σι themselves therefore give a 
unique label for the irreducible representations. For A(n — 1) it is easy to 


(21) 
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show that if A = (A,,..., 4,) is the highest weight of an irreducible represen- 
tation, then g; = (A; — A;4,),i=1,...,n—1. 


Definition 9 A nonzero highest weight that cannot be expressed as the 
sum of two nonzero highest weights is called a “fundamental” highest 
weight, and the corresponding representation a “fundamental” 
representation. 


Property 8 The fundamental highest weights are A’, for which σ᾽ = 6;;. 
There are therefore as many fundamental highest weights as there are simple 
roots and can therefore also be chosen as a basis for the root or weight 
space; in fact if A is any highest weight, then A = δὲ σι A‘. 


For A(n — 1) there are n — 1 fundamental weights and corresponding 
representations; suppose A‘ = (A,, ..., A,), then since A’ + a, = 6,; we have 


Ayer =A, = Ain, +1 =°:: =A, 4+ 1, and since A‘ lies in the root hyper- 
plane A, + τ + 4, = 0. Solving for A, gives 1, = (n — i)/n, so 
1 i i+1 n 
Ai = A. oes eh 
n n n n 


Let N(A) be the dimension of D(A); now since the simple roots are a 
basis for the root space, we may write 


A= Σ Ι,α; (22) 


where the sum extends over all simple roots (i = 1, ....n — 1 for A(n -- 1) 
and i = 1,..., n for the other simple Lie algebras). In general, of course, the ]; 
will differ from the g; above. Also, an arbitrary weight Με W(A) may be 
written 


ΜΈΣ, m; αἱ» (23) 


50 
Λ-ΜΕΣ (I, — m)a;, 


which simply expresses the fact that M may be obtained from A by the 
subtraction of a suitable number of simple roots. Clearly, then, ἰ; — m; must 
be a nonnegative integer. Let 


μίΛ, M) = pa (J; — mj), (24) 
so (A, M) is the total number of subtractions of simple roots needed to give 


M from A. Let A’ be the lowest weight ε W(A); if A = (A,, 42,..., A,), then 
for A(n — 1) clearly A’ = (A,, ..., 42, A;). 


Definition 10 The “height” T(A) of D(A) is defined by T(A) = μ(Λ, A’). 
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Definition 11 A “layer” of W(A) is a subset w,(A) consisting of all 
Με W(A) such that u(A, M) = 


w (A) therefore consists of all M obtainable from A by the subtraction of 
j (not necessarily distinct) a roots. — the number of layers of 
W(A) = T(A) + 1 and W(A) = (J 728 a,( 

Let s,(A) denote the ef of a : co,(A), in which each weight is 
sera) 7 number of times equal to its multiplicity. Then N(A) = 
YT’ s(A), and because of the simplicity of A and A’, we have 


So(A) = ὙΠῸ = 1. 


Property 9 The layers οἵ W(A) are spindle shaped, ie. 5} (Δ) Ξ 
Stay- (A) and so(A) < s,(A) Ξ """ <s,(A) where r = [57(A)] (the largest 
integer <5T7(A)). 


It is this property that we shall apply to the fundamental representations 
to derive our theorems on various types of partitions of integers. 

We now consider the case of A(n — 1) and show that s,(A‘) equals the 
partition (j; n — i, i) of j into at most n — i positive integers none of which 
exceeds i. Now 

1 i ΪΈῚ n 
Ai =(a,...,a,a—1,...,a—1) where a=(n-—i)/n, 


and the only simple root that may be subtracted from it is a;, and this may 
be subtracted only once to give 
1 i-1 i i+ 1 V+ 2 n 
A'—a;=(a,..., a,a—1, a,a—1,...,a—1) 


which is equivalent to A‘. Next, the only simple roots that one may subtract 
(again just once) from A‘ — a; are ας 1. again to yield weights equivalent to 
A‘. Repeating this process it is easy to see that if Με W(A’), the only 
permissible subtraction of a simple root from it is one which interchanges an 
a— 1 with an a immediately preceding it, this being achieved by a single 
subtraction. This shows that every weight ε W(A‘) is equivalent to A‘, and 
therefore simple. The dimension of D(A‘) therefore equals the number of 
distinct permutations of the components of A’, and this equals the number of 
ways of choosing i components to be a, Le., 


N(A‘) = () (25) 


Next, T(A‘) equals the number of subtractions of a simple root from A‘ to 
obtain the lowest weight 


1 n-i n-itl1 n 


Δ΄ =(a—1,...,.a—1, a ,...,a). 
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Clearly, A" is obtained from A’ by i interchanges of the first a — 1 (reading 
from the left) with the preceding a, followed by i similar interchanges of the 
second a — 1, and so on. Each step in the process involves the subtraction of 
a single simple root, so since there are n — i (a — 1)’s, we have 


T(A‘) = i(n — i). (26) 


Finally, we calculate 5 (Λ΄); since each weight of W(A’‘) is simple, this 
equals the number of weights € w,(A’‘). We must therefore calculate the 
number of distinct ways of subtracting j simple roots from A‘, in other words 
the number of distinct ways of successively interchanging j (a — 1)’s with 
preceding a’s. Suppose, for ἰ = 1, ..., n — i, the Ith a — 1 undergoes Κι such 
successive interchanges; clearly the (/ + 1)th a — 1 cannot have more such 
interchanges than the /th a — 1, and since there are i a’s, no a— 1 may 
undergo more than i interchanges. s,(A‘) is therefore the number of distinct 
solutions of the equation Κι +---+k,_;=j subject to the conditions 
Κι >k,>°°:>k,_; and Κι <ifor!=1,...,n— 1. But this is just the parti- 
tion function (j; n — i, i); hence 


s,(A‘) = (j; n — i, ἢ. (27) 


Equations (25) and (26) and Property 9 now imply the following proper- 
ties of (j; n — i, i): 


per ἃς τὸ n 
Σ, G3 n-i, i) = ("). 
j=0 ; 

which is just the Eq. (4) trivially derived using generating functions in Sec- 
tion 1, and 


Theorem 1 

(a) (); n—i, i) = (i(n— ἢ —j;n—i, ἡ); 

(Ὁ) (7: -- ἰ, ἢ) 5 () -- 1, -- ἱ, i) for 0 <j < [i(n — i)/2]. 
Theorem 18 is, as mentioned in Section 1, merely a statement of the recipro- 
cal nature of the Gaussian polynomial [7](q). Theorem 1b, on the other hand, 
is not trivial and although it was in fact proved a long time ago by Elliott [5] 


using invariant theory of quantics; it does not appear to be very widely 
known. 


3. B(n) 


Having in the preceding section given the general method for deriving 
the simple roots and fundamental weights of a Lie algebra, we shall content 
ourselves in this and the following sections with writing them down without 
derivation (these can all be found in [6]). B(n) is the Lie algebra of the group, 
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O(2n + 1) of (2n + 1) x (2n + 1) orthogonal matrices, and has rank n. Its 
positive roots are n-tuples of the form 


j k 
(0,...,0,1,0,...,0+1,0,...,0) and (...,0,1,0,..., 0), 


the simple roots being 
i itl 


a; = (0,...,0,1, —1,0,..., 0) for i=1,...,.n—l, 


and a, = (0, ..., 0, 1), so 2(α; - L)/(@;-a;)=a;°L, i= 1, ..., n—1, and 
2(α, ° L)/(@,, * &,) = 2(a, ° 1. Using 2(Λ' - α;),Κι,ῳὺα; ° «,;) = 6;;, we find the 
fundamental highest weights to be 

1 i iti n 


Ai=(1,...,1, 0,...,0), i=1...,n-1, and  A"=(,..., 4). 


We treat the nth fundamental representation separately from the rest, deal- 
ing with it first. 

By Property 5, the only simple root that may be subtracted from A" is @,,, 
this being subtractible only once to give A" — a, = (3, ...» 4, —4), which is 
equivalent to A". a, ; may be subtracted once from this to give the weight 
(4,...,4, —4,4), and from this, one may either subtract a, _ ) to give (3, ..., 7 

—4, 4,4), an a, again to give (4, ..., 4, —3, —%). It is easy to see, in fact, that 
an arbitrary weight Με W(A") has either 4 or —% for its components, ἡ 
subtraction of a simple root being equivalent to ether Miterchanging a—4 
with an immediately preceding 4 or to changing an nth 4 into a —4. At no 
stage can a simple root be subtracted more than once, so each eight is 
ecuivaient to A" and therefore simple. Clearly, the lowest weight is 
A” = (—4,..., -4} 

Now the number of weights with | —4’s is (1), so the dimension of D(A") 
is given by 


N(A*) = "ἢ. )=2. (28) 


--- 


Also, in order to obtain A” from Δ" one must change the nth 3 into ἃ —4 and 
then permute it step by step with the omediately preceding 5 until one 
obtains, after n simple subtractions, (—4, 4, ..., 3). Repeating the a it for 
this weight one obtains, after a further n — 1 simple subtractions, (— ὁ, —3, 3, 

., 4) and so on. By the time A” is attained, the total number δὲ ‘simple 


subtractions undergone is n + (ἢ — 1) τ τἘ 2 Ἐ1, 50 
T(A") = ἐη(π + 1). (29) 


Next, s,(A”) is the number of distinct ways of suptscung j simple roots 
from A"; suppose, for some M € w,(A”), the Ith — ὦ is in the (n — k, + 1} 
position, where the —4’s are numbered from the left and we interpret a —4 
in the (n + 1)th σον δα as a 4 in the nth position. The replacement of ἃ 4 in 
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this position by the —4 is achieved by Κι simple subtractions. Clearly, no Κι 
can exceed n, and k, > k,,, unless they are both zero; therefore j = k, + 


“+ +k,, and 
s(A") = P(j; n,n), (30) 


where P(j; n,n) is the number of partitions of j into at most n strictly 
decreasing integers, none of which exceeds n. 
Equations (29) and (30) now imply that 
n(n+ 1)/2 
P(j; n, n) = 2" 
j=0 

which is just Eq. (6) derived using the generating function of P(j; n, n); and 
Property 9 implies: 


Theorem 2 


(a) P(j; n, n) = PGn(n + 1) —j; n, n); 
(Ὁ) P(j; n, n) > P(j — 1; n, n) for j < [πη + 1)]. 


(a) is, again, merely a statement of the reciprocal nature of the generating 
function for P(j; n, n); (b) appears to be new. 

We now consider the general case of A‘, i#n. By the subtraction of 
simple roots a;, i # n, we obtain all weights whose components are permuta- 
tions of those of A‘, and so which have i components equal to 1 and n — i 
components equal to zero. These weights are all equivalent to A‘ since at any 
given time a given a; may be subtracted only once. For any such weight with 
1 for its nth component we may also subtract a, either once to give a weight 
with 0 for its nth component, or twice to give a weight with — 1 for its nth 
component. The latter weight is equivalent to the original weight and there- 
fore also to A’, whereas the former weight is not. 

Proceeding in this manner we see that W(A‘) consists of all weights with 
a number <i of components equal to + 1, the rest of the components being 
zero. For any such weight a, may be subtracted once if the ith component 15 
1 and the (i + 1)th component is zero or if the ith component is zero and the 
(i + 1)th component is —1, but if the ith component is 1 and the (i + 1)th 
component is — 1, a; may be subtracted either once to give a weight with two 
more zero components or twice to give a weight with the 1 and —1 inter- 
charged. If the nth component is 1, αν, may be subtracted either once to give a 
weight with one more zero component or twice to give a weight with the nth 
component equal to — 1. Weights with the same number of + 1’s are equiva- 
lent and so belong to the same shell. Let G, denote the shell consisting of 
weights with i — k components equal to + 1 and n — i + k zero components; 
clearly 


W(A') = UG. (31) 
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The lowest weight 


1 i n 
A’ =(-1,..., -1,0,...,0)€ Go, 


and the dominant weight of G, is 
1 k n 


M, = (1, sey 1,0, gO) 


We use M, to calculate the multiplicity of weights € σι, using Freudenthal’s 
formula, Eq. (21), and denoting y(M,) by N, for short. First, using 
ὃ = 4(2n — 1, 2n— 3, ..., 3, 1) we obtain (A‘ + 5)- (A‘ + δ) — (M, + δ): 
(M, + δὴ = k(2n — 2i+k +1). Secondly, n—i+k positive roots of the 
type (0, ...,0, 1, 0,...,0) may be added to M, once only to give a weight of 
G,— 1, whereas (n — i + k)(n — i + Καὶ — 1) positive roots of the type (0, ..., 0, 
1,0,...,0, +1, 0, ...,0) may be added once only to give a weight of G,_ 2. In 
each such case, (M, + α) α Ξ (α " α), so substituting into Eq. (21) we 
obtain the recursion relation 


k(2n — 2i+k + 1)N, = 2(n-—i+k)2(n—it+k— 1)Ny-2+ Ny-1), 
k>1, 


where N_, = 0. Using Νὺ = 1 we obtain easily N, = 1; the solution of the 
recursion relation subject to these conditions 15 


n—i+t 2k n-i+2k+1 
N= | k ) N 2K +1 - k ) (32) 


Consider G,; weights in this shell have (n — i+ k) zero components 
which may be in any of (;",) distinct arrangements. For any one of these 
arrangements, suppose there are j 18 and (i — k — y) —1’s; there are (';*) 
such arrangements of the +1’s, so the total number of weights with 
(n —i+k) 0’s in any given arrangement is 

> - ᾿ ΒΤ 

j=o\ J 
Hence the number of distinct weights in G, is (;",)2'~* and the multiplicity of 
G, is N,(;",)2'~*. Summing over all shells we obtain the dimension of D(A‘); 


this is given in Table 30 of [6] as (25 1), which is therefore the value of the 
above sum: 


N(A‘) = y νι n J+ = | 
k=0 i—k 
We next calculate the height of W(A‘); let 


=1,...,n-—1. (33) 


oe 
j " 
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so that Αἰ = Ao, A” = Αἰ; now 


j 
Ai = Ai_, —2(0,..., 0, 1,0,...,0) = Αἱ, — 2; ++ +.,), 


so Aj is 2(n — j + 1) layers down from Αἰ... Hence Aj is 2(n + (n — 1) + 
-++(n—i+ 1)) layers down from Aj, 1.6., 


T(A‘) = i(2n — i + 1). (34) 


We now consider an arbitrary weight of G, and determine to which layer 
it belongs by calculating how many layers down it is from M,. M, itself can 
easily be shown, in a manner analogous to that used to calculate T(A’), to 
belong to ὠκ(2-- 2i+k+1)/2(A'). Let M,(I,, ..., |;-,,) be the weight ε G, with 
i — k 1’s in which, for A = 1, ..., i— k, the Ath 1 is, reading from the left, in 
the I,th position (so 1 <1, <1, <--: <1;_, <n). To get to this weight from 
M,, the Ath 1 must be moved forward 1, — A places, each such movement by 
one place being achieved by a single subtraction of a simple root. M,(Ij,..., 
l;_,) is therefore 


i-k 


Y (L, -Σὰ- k)(i —k + 1) 


A=1 
layers down from M,. 
Next, let Mii". mn be the weight with (i — k) + l’s of whichh(0 <h< 
i — k) are —1’s such that the 0’s are in the same position as in M,(I,, ..., 
l;-,), and such that, ior w=1,..., h, the wth —1 is in the l,, th sae 
where 1 <m, « πιχ « "" « κι, Ξ ἰ-- Κ΄ Now since it takes 2(n — I,, + 1) 
simple pea to ἘΠ ὦ 1 in the /,,, th position by a — 1, to get from 
M,(i;, ..., 1;-,) to Mee ὙΠῸ, requires 


h 
2F (n—b, $1)=2hin + ΠΣ |, 
w=1 nw=1 


simple subtractions. The number of simple subtractions needed to get from 
M, to Min aod is therefore 


Yh-23 4(i — k)\(i — kK + 1) + 2h(n + 1) 


= Σ Hl — Hi — γι — b+ 1) + 2h(n + 1) 


where h of the δὲ are — 1 and the rest are + 1. Finally, therefore, Mim» τοῦ σὰ 
is On w;, where 


i—k 
j= δ᾽ Ohl, + (n+ 1)(2h +k) —4i(i + 1). 
A=1 
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Let Q(j’; a, b, c) be the number of partitions of j’ into exactly a strictly 
increasing positive integers none of which exceeds b and in which c of the 
integers contribute negatively. From the above it is easy to see that 

i-k 


SA‘) = ΣΝ, Σ Οὐ — (n+ 1)(2h + k) + fifi -Ὲ 1); i—k, n, h). (35) 


Using Eq. (34) we see that 
i(2n-—it1) i-k 
δ O(j — (n+ 1)(2h + k) + 3i(i + 1); i—k, n, h) 
j=0 h=0 
is the number of distinct weights ε G,. We have already seen that this 
number is (;",)2'~*, and the equality of these two expressions can easily be 
shown using Eq. (8), with the appropriate values for j’, a, b, and ὁ (jnin and 
i'nax are easily seen to correspond to j = O and j = i(2n — i + 1), respectively). 
Clearly, Eq. (10) could have been used to derive the formula for the number 
of distinct weights of G,, but no great advantage would have ensued from 
doing so. 
Property 9 of the last section shows the following to hold: 


Theorem 3 Fori=1,...,n—1, 
i i—k 
(a2) YN, Σ Οὐ — (n+ 1)(2h 4+ k) + ϊ{{ + 1); i—k, Ἢ, h) 
k=0 h=0 


i i-k 
= YN, Σ Oli(2n-—i+ 1)—j—(n+ 12h +k) + 5ifi + 1); i—k, π, h). 
k=0 h=0 


i ick [0(] — (n + 1)(2h + k) + ΜΓ + 1); 1 -- Κα ἡ, h) 
(Ὁ) ΣΝ Σ ἡ ρ(1-- 1 — (n+ 12h + k) + 4iGe + 1); 1 -- kmh) 


for 0<j <4i(2n—i+t 1). 


>0 


As mentioned in Section 1, (a) follows easily from the property 
Q(j’; a, ὃ, c) = Ο(-- 7": a, b, a— c) when the appropriate values of j’, a, ἢ, 
and c are taken, and in fact the equation holds with the δ᾽ -ο N;, omitted so 
that the shells themselves have a symmetric layer structure. This is a fact that 
does not follow from Dynkin’s theorem; the author does not know whether 
it is generally known. No previous proof of (b) appears to have been given. 


4. C(n) 


C(n) is the Lie algebra of the group S,(2n) of symplectic matrices of order 
2n, and again has rank n. Its positive roots are again n-tuples of the form 


j k 
(0,..., 0, 1,0,...,0, + 1,0,..., 0) 
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(like B(n)) and (unlike B(n)) 


the simple roots being 


i i+1 


a,=(0,...,0,1, —1,0,...,0) and a, =(0,..., 0, 2); 


here 2(a, - L)/(a, - α,) = 3(a, - L). Using 2(A‘ + a,)/(a,; - αἰ) = δι; we find the 
fundamental highest weights to be 


fori=1,...,n. 

The analysis of D(A‘), i= 1, ..., n, is very similar to that of D(A‘), i = 1, 
..,n— 1 of B(n), the weights again having either +1 or 0 for their compon- 
ents. However, a, may be subtracted only once from a weight with 1 as its 
nth component, to give a weight with — 1 for its nth component, so a weight 
cannot have its number of zero components increased by one. Whenever a 
weight has a | immediately preceding a — 1, it is still possible to subtract an 
a;, i #n, twice, one subtraction resulting in a weight with two more zero 
components. The shell structure of W(A‘) is therefore similar to the case of 

B(n), except that only the G,,, k =0, ..., [i/2], occur, 1.6. 

τς {{|2] 

W(A‘) = U Gr. (36) 


k=0 


Using the dominant weight 


1 : 
M,, = (1,..., it , 0, ..., 0) 


to calculate the multiplicity of weights of G,,, and the expression 6 = (n, 
n—1, ..., 2, 1), we find that this multiplicity is ((n—i+ 1)/(n—i+ 
k + 1))N,,, where N,, is as given in Eq. (32). This implies that G,, has total 


dimension 
(n—i+1) [n—i+2k nN \>i-2 
(n—i+k+1) k i — 2k 


The total dimension of D(A‘) is therefore 


᾿ς ΕΣ δ j= ()-((4) 
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Letting 


j 
Ai=(-1,...,-11,..., 1,0,..., 0), 


we find that for C(n), Aj= Δ’, -- 2(α; Ἑ "- Ἑ α,- 1) -- α,, so Aj is 
{2(n -- j) + 1} layers down from Αἰ; 4; using this we find that the height of 
W(A‘) is given by 

T(A‘) = i(2n — i). (38) 


With the same notation as in the preceding section, and by highly similar 
methods, i can easily be gs that an arbitrary weight of the type 


Ma Ty» Where Κ = 0,...,[i/2] and h = 0,...,i — 2k, is on the layer W;, 
where now 
i— 2k 
j= Σ᾽ O41, + (2n4+ 1)(h +k) -- Hii + 1) 
A=1 
and hence 


(3 any roe i- 2k 
Es 
zo (n —i +k +1) i= 


'χ Οὐ — (2n + 1)(h + k) + Zi(i + 1); i — 2k, n, h). 
Property 9 of Section 2 implies the following: 


S(A‘) = 


Theorem 4 For i=1,...,n, 


[i/2] (n — ἱ - 1) i— 2k 
(a) A,a-i+k+1)* ΡΣ 


x O(j -- (2n + 1)(h + k) + ΣΙ(ἰ + 1); i — 2k, n, h) 
κοι (n—i+1) i- 2k 
i= ,(n—-itk+1) 5 py 
x QO(i(2n — i) —j — (2n + 1)(h + k)+ Ζ{{{-Ὲ 1); i — 2k, n, h). 


{i/2] (n—i+ 1) i— 2k 
—_—_____“___ J 
Ξοίη --ἰἰ ἘΚ - 1) an 2 
: Ο() — (2n + 1)(h + k) + ἐξ + 1); i — 2k, n, h) 
—Q(j -- 1 -- (2n 4 1)(h + k) + 4i(i + 1); i — 2k, n, h) 


>0, for 0<j <[5i(2n — ἢ]. 
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As in the case of B(n), (a) holds with the k summation omitted, as can 
easily be proved using Q(j; a, b, c) = Q(—/j; a, b, a—c), so that the shells 
themselves are layer symmetric, whereas (b) appears to be new. 


5. D(n) 


D(n) is the Lie algebra of the group O(2n), and has rank n. Its 
positive roots are n-tuples of the form 


k 
(0,..., 0, 1,0,...,0, +1,0,..., 0) 


and the simple roots are 


a,=(0,...,0, 1, —1,0,..., 0), isn 
and 
ἃ = (ΟΣ ρον Ὁ 151), 


The fundamental weights are easily found to be 


AY = (1,225.1, 05412; 0) for i=1l,...,n—2, 

A" ' =(§,...,3, -$) and A"= (4,..., 4). 
W(A"~*) and W(A") are rather similar in structure to W(A”) of B(n), both 
having dimension 2" * and height (n(n — 1)/2). 5,(Δλ" 1) and s,(A”) are both 
equal to P(j; n -- 1, n — 1), so application of Property 9 yields Theorem 2, 
with n replaced by n— 1. Nothing new is therefore obtained from their 
analysis. 

For i <n — 2, W(A’‘) has a similar structure to W(A’) of C(n), with shells 
Gy,,k =0,..., [i/2], consisting of all n-tuples with n — i + 2k zero compo- 
nents. Freudenthal’s formula, with 6 = (n — 1, n — 2, ..., 1, 0), implies that 
weights of G,, have multiplicity N,, given in Eq. (32), so here G,, has total 


dimension 
n—i+2k n ni- 2k 
k i — 2k 


Summing over all shells, we obtain the following expression for the total 
dimension of D(A‘): 


W21/y — j + 2k n 2n 
A! ΞΞ i— 2k _ i= ΠΥ --- Ζ. 
Ν(Δὴ ΣΙ k Ἱ Ν xx)? ( ) Ξε 1....,η -- 2 
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An analogous calculation to those used for B(n) and C(n) leads to 
T(A‘) = i(2n — i — 1) (40) 


and an arbitrary weight of G2, of type Μίκων k = 90,..., [i/2], h = 9, 
, i — 2k, is found to be on the layer @, where here 
i— 2k 


j= Σ OL, + 2n(k +h) — di + 1) 
A=1 


SO 


[i/2] P= 2k 


s(A‘)= ΣΧ ΝΣ Οὐ — 2n(h + k) + aii + 1);i- 2k, n,h) (41) 


Property 9 implies the following: 
Theorem 5 Fori=1,...,n— 2, 


{i/2] i— 2k 
(a) YNy Y Oj —2nlh +k) + 4i(it 1); i — 2k, n, h) 
k=0 h=0 
[2] i—2k 


= Nox Σ Olin — i= ἢ - —j —2n(h +k) 


=0 κΞ 
ἘΞ + 1); i— 2k, n, h) 
(b) Nog 2k [0(] - 2n(h + k) + 3i(i + 1); i - 2k, ἡ, h) 
mK |-O(j — 1 — 2n(h + k) + Ziff + 1); i — 2k, 1, h) 


>0, for 0O<j<zi(2n —i-— 1). 


Again, using Q(j; a, b, c) = Q(—j; a, b, a — c), (a) could be proved very 
easily even with the ) {2} N>, omitted, showing the shells themselves to be 
layer symmetric, whereas (b) appears to be new. It should be stressed that in 
(b) of Theorems 3-5, neither the k nor the h summations may be omitted 
without invalidating the formulas, as simple counterexamples will soon 
verify. 

To summarize, using one theorem concerning representations of the 
simple Lie algebras, five theorems concerning partitions have been obtained, 
the part (a) of the theorems being trivially derivable using usual methods, 
but part (b) being nontrivial. On the other hand, using elementary properties 
of partitions we have seen that the symmetric part of Dynkin’s theorem 
actually holds not just for the representations but also, in the case of the 
fundamental representations of A(n), B(n), C(n), and D(n), for the shells of 
the representations. 
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The question as to whether new properties of partitions might ensue by 
applying Dynkin’s theorems to arbitrary irreducible, or even reducible, re- 
presentations of the simple Lie algebras, including in this case also the 
exceptional Lie algebras, might be worthy of further investigation. 

I should like to thank Professor G. E. Andrews of the Pennsylvania State 
University for encouraging me to write this paper, and for several extremely 
helpful communications. 
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1. 


For s =o + it, L(s) stands throughout this paper for L(s, 7,, 4), given for 
a>1by 


= χει, 2 (-1}" 
a ees ‘ai 


In our paper [1] we showed that if the Riemann-Piltz conjecture holds for 
L(s), 1.6., 


L(s)#0 for o>3, (1.2) 
then for suitable explicitly calculable positive constants A,, A,, and A, the 


inequality 


» (-- 1) 112 log p exp(—4, log? 2) < ~A,,/x (1.3) 


p>2 


holds for all x > A;. This fact gives great interest to the problem to find 


+ Deceased. 
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explicitly long sequences of consecutive primes P,, P,+ 1, Py+2>-++5 Py+_ With 
Py = Py41 ='°* = Pytp = 1 mod 4. (1.4) 


The longest such sequence below 10° was found by Den Haan in Eindhoven 
consisting of 11 terms; it would be very desirable to find longer such se- 
quences, below 103, say. 


This problem suggests a number of exciting questions if we add the 
qualification “infinitely often.” The simplest such problem is obviously that 


concerning 
def 


f(x) = » I (2.1) 
pen τὰ 
if x —-> oo. When asked about the possibility of finding an elementary proof of 
the relation 
lim f(x) = +00 (2.2) 


x? 6) 


some years ago, Erdos was definitely very pessimistic about it. As remarked 
by Ingham on the last page of his book [2], Littlewood’s method for proving 


Him (x(x) - [" al |- + 00 


x7 a log r 


gives—with the usual notations—mutatis mutandis 
lim {x(x, 4, 1) — a(x, 4, 3)} = +00 


x a 


lim {2(x, 4, 1) — x(x, 4, 3)} = — οὐ, 


x7 


which clearly implies (2.2) (and nothing more). Let c stand throughout this 
paper for positive explicitly calculable numerical constants, not necessarily 
the same in different occurrences. Then we are going to prove the following 


Theorem For T > c, the inequality 
f(T) > log” T 
holds with positive explicitly calculable constant B. 
For f3(x) defined by 
f(x) = » 1 (2.3) 


Py-2=Py-1 =Py=1 (4) 
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no proofs are known even for 


lim f3(x) = oo. 
x 0c 
It seems to be hopelessly difficult at present to prove for arbitrarily large 
prescribed ὦ the existence of one sequence p,, Py+1, --+» Py+@ With 
Py = Pvt = *** = Pyt+o = I (4). (2.4) 


This first lower bound for f,(T) in our theorem is seemingly not very 
strong, though the (small) exponent B could be considerably improved by 
a more careful treatment. Much more interesting seems to be our guess that 
the events 


Py = Pyv+1 = 1 (4), Py = Py+1 = 3 (4) 
py=1(4) Py+1 =3 (4), py =3 (4), Pye = 1 (4) 


are “not equally probable.” It would be of definite interest—and perhaps 
feasible in the not too distant future—to prove 


fils) = of 2 ) (2.5) 


log x 


We shall mention a further problem of localization later. 


Now we turn to the proof of our theorem. In our paper [3] we proved the 
following (see Knapowski and Turan [3, p. 201]). Suppose that in the strip 
|t] <zlog’’° T the function L(s) has at least one zero in the 
parallelogramt 


a>44log-"2°T, — |t| <4 log!°T. (3.1) 
Then we have for T > c, numbers U,, U2 with 
(2 <) T exp(—2 log’?"° τ < U, < U, <T, (3.2) 


so that the inequality 
(—1)- 9? log p> ΟΣ (3.3) 
U,<p<U2 


holds. Let us call a p, = 1 (4) a “bad” one if p,, , = 3 (4) and a “good” one if 
Py+1 = 1 (4). Associating each bad p, with its p,,, the contribution of 


+In (9.1) of Knapowski and Turan [3] the inequality |r| <7 reads correctly 
|t| <4 log’!° T. 
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(log p, — log p,, 1) being negative can be dropped from the left-hand side of 

(3.3). The same holds for the contribution of the p, = 3 (4) which were not 

associated with a p,_, = 1 (4). Hence from (3.3) it follows in the case of (3.1) 
{f(U2) —f(Ui)} log Uz > U2 


and a fortiori 


f(T) > = U2 > ./T exp(—3 log!*/1© ΤῊ (3.4) 
log U, 


for T > c. This is much stronger than the inequality of the theorem. 


4, 


In this first case (3.1) we got also the “local” theorem that the interval 
[T exp(—2 log'*/'© T), T] (4.1) 


contains a pair (p,, py.1) with p, = p,+, (4). This was noticed already in 
Knapowski and Turan [3]. We have now to investigate case II which— 
owing to the functional equation—means that the roots of L(s) in the strip 
|t| <3 log’/!° T are contained in the narrow parallelogram 


[σ —4| <log™ 129 T, Jt} <4 log’? T. (4.2) 


For this case we stated in Knapowski and Turan [3] that for T > c and 
suitable 


log; T < U, exp(—log!?"® U,)< U, < U,<T (4.3) 
the inequality 
Σ (-1)?? log p> ΟΣ (4.4) 
U;<psU2 


holds. The proof of (4.3}-(4.4) in case II was on p. 201 of Knapowski and 
Turan [3], postponed to the English edition of the book of the second author 
since it was rather long and uses appropriately ideas of Littlewood, Ingham, 
and Skewes. This will be indeed so (even with log, T instead of log, T in 
(4.3)), but for our present theorem a much shorter argument can be given 
(based again on ideas of Littlewood, Ingham, and Skewes essentially). 


5. 
Let 


def 
(— 1)" ?7A(n) = A(x) (5.1) 
n<x,n odd 
def 


Σ (-1)°"? log p = A, (x). (5.2) 


2<p<x 
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Obviously, 
A,(x) = A(x) — Σ log p + O(x!? log x) = A(x) — (1 + o(1))/x. 


ps vx 


Hence if for a € > c we have 


A(E) > 3.( (5.3) 
then also A,(¢) > ./é holds and the reasoning of Section 3 gives 
fale) > V/E/(los 6). (5.4) 


Hence we have to produce in case II, i.e., under supposition (4.2) a possibly 
large € < T satisfying (5.3). 


In order to do this we write 
def 
A(e*)e~ 52 = G(9) (6.1) 


and we start with the “exact” formula of Riemann-Mangoldt in the finitised 
form as one can find, e.g., in Prachar’s book “Primzahlverteilung,” p. 228. 
Restricting us to 


log log log T < 3 < τ log log T (6.2) 
(to which we are forced by (4.2)) and denoting the nontrivial zeros of L(s) by 
p= B+ iy (6.3) 
we get from it easily 
p(P-(1/2))9 
G(3) = -- —_-—§ + o(1). (6.4) 
ly] < (1/2) log!/10 T ρ 


Now we can use (4.2). We get 


elP—C1/2))3 pis ele (1/2))$ _ gid 


p ly p 
< : ex | : ) ἢ Ἐπ’ 
= Ty p Ιου 20 T γ} 


owing to (6.2) and (6.4), this gives under (4.2)-(6.2) 


iyd 


<C. (6.5) 


(8). 


ly] Ξ 72) log2 T EY 
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If A is a big constant to be determined later, we form after Ingham the 


integral 
| a y \ {sin(y/2)\? 
H(o, =a) ,Ὁ o+4) y/2 dy (6.6) 


where ὦ is restricted by 
log log log T <w—1<w+i4< τ log log T. (6.7) 


Using (6.6)- (6.5) and completing the integrals to (— 00, +00) we get, using 
also the formula 


ft smb) ἐν dy = [1 for -1<A<+1 
2π y/2 0 for |A| >1 


of Fejér, the inequality 


το 


iyw 


Ho A+ “ 
isa ἢ 
1 Ξ sab) 
<c{1l+ ii] Geom exp(i(y/A )y) dy | . 
| ena r |? Ν y/2 ΠΩ 
(6.8) 
Since the contribution of the terms with |y| <A to the last sum is 
ς 1 log? A 
<— wa<c <c 
A ᾽" I? | A 
and those of the terms with A < |y| <4 log*’’® T after partial integration 
2 [5 + τι] 2 
astisaa roger [7] 11}} 4* 7} 
we get from (6.8) 
H(w, A)+2 δ᾽ pene Co (6.9) 
O<y<A 


where Cy is an explicitly calculable positive numerical constant (1.¢., does not 
depend on A either), whose value matters somewhat. 


We shall need the elegant lemma of Ingham which asserts with 
log? A 


to = y (7.1) 
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the inequality 


1 
i) sin Yoo —} log τΤι 


O<y<sa 7 | οἱ 


(c, is the same sort of constants as co was in (6.9)). 


8. 


Now let A be the smallest constant satisfying the inequalities 


og? A 
further with the constants cy and c, 


Cox<szrlog A, c, <3 log A, 


1 c (a) ee 


2n — A/4 y/2 
and finally 
N(A)>1 
where 
N(Xx)= YL 
O<ysx 


If A* is the value of this A, (6.9)- (8.2) give 


H(a, A*) > —2 


O<y< A* 


Next we choose ὦ (taking care of (6.7)). Putting 
4 = [800zx log 45] 
Dirichlet’s theorem gives the existence of an integer m with 


I — N(A*) 1 
50η 1 log log T <m <5 log log T 


so that for all y’s in (8.4), the inequality 


A 
<i, 2 log; 3, 23 log A, τς log A > 3, 
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(7.2) 


(8.1) 


(8.2) 


(8.3) 


(8.4) 


(9.1) 


(9.2) 
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holds.t With this m we choose 
def 
ω τὸ πὶ τ ἂρ ΞΞ Οὗ. (9.4) 


(9.2)-(8.1) assures that (6.7) is not violated for T > c. Then we have for 
all of our y’s with αὐ = —(log? A*)/A* 


m 
sin yos — sin [να + 2n =| 


es tie *| _ 
|sin yas — sin yo* | ᾿Ξ 


2 
sin γα -- sin( vo + 931) | <—, (9.5) 


and hence using (9.5)-(8.1}-[7.2} (8.2) 


sin γω sin γαὰξ Cc 1 
O0<y< Ax 7 O0<y<A* δ 4 0<y< Ax ly | 
sin yas 
> -- ἐμεῦ —~ τς log A* 


O<y< A* 
> 4 log 4 — οἱ — 760 log A* > (4 — τσ) log A*. 
Then (6.9) gives from (8.2) 
H(w*, A*) > (4 — zn) log A* — ἀξ log A* = (ἐδ — τσ) log A*. (9.6) 

Going back to G(x) this gives, owing to (8.1}-(8.2), 

a y \ (sin(y/2)\’ 

ees κι ἢ d 1 lop A* : 

55. [oo + Fall y/2 y > 7z log A* > 3 


Hence there is an 


w*—-4<awrt*<w* +4 (9.7) 

so that 
A(e®**) exp(—43w**) = G(w**) > 3, 

1.€., 

A(e®**) > 3 exp(3w**). (9.8) 
Putting 

exp(w**) = 

we obtain 


Δ(ξ) > 3,/é 


+ {x} denotes as usual the distance of x from the next integer. 
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and from (5.4) 
fl) > 3/8. (9.9) 
Since from (9.7}-(9.4)-(9.2) for T > c 
E < exp(w* + 4) <2 log'?°* T< T 
and 
E > exp(w* — 4) > log’ T, 
these and (9.9) give in case II 


F(T) > f2(S) > wf > log® T 


with a positive constant B indeed. This and (3.4) complete the proof of our 
theorem. 
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Introduction 


Let A be a commutative ring equipped with an involution J ,. For con- 
ciseness, we write A instead of the pair (A, J,), and denote the ring A 
without involution by |A|. A signature o on A is defined as a homomor- 
phism from the Witt ring W(A) of nondegenerate hermitian forms over A to 
the ring of integers Z. If |A| has a connected spectrum, either A has no 
signatures at all or the kernels of the signatures are precisely all minimal 
prime ideals of W(A), as has been shown in Knebusch [7, I, §2] and Dress [3]. 
It also can be shown that A has no signatures if and only if — 1 is a sum of 
norms xJ,(x) in A (proof to be published, cf. [8] for | A| semilocal). In the 
case that A 15 a field and J, 1s trivial it is well known that the signa- 
tures of A correspond uniquely to the orderings of A[5, 11]. For |A| a 
semilocal ring, a general study of signatures and related topics can be found 
in Knebusch et al. [10] and [8]. 

For a homomorphism φ: A — B into another ring with involution B an 
extension τ of a signature o on A to B (with respect to @) is defined as a 
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signature t on B such that the diagram 


W(A) —*—> W{(B) 
\/ 


with g, induced by φ is commutative. Recently I developed a theory of “real 
closures” of a pair (A, o) with A an arbitrary commutative ring with involu- 
tion and o a signature on A (cf. [7]), which generalizes Artin-Schreier’s well- 
known theory of real closures of ordered fields. This theory 15 tied up with a 
theory of extensions of o for @ finite etale {i.e., |g]: | A|—|B| finite etale}. 
As has been shown in [7] there exist only finitely many extensions of σ to B 
in this case. Moreover the regular trace Trg,, from B to A yields a 
W(A)-linear map , : 


Tr¥,4: W(B) > W(A), 
(cf. (12, 7]) and for z in W(B) the following trace formula holds true [7, I, 83]: 
σ(ττῇ, (2) = Σ n(x)e(2) 


τ|σ 
In this sum τ runs through the extensions of σ to B, and the “multiplicities” 
n(t) are positive integral numbers, uniquely determined by τ and φ. If | A| is 
semilocal and J, is trivial, all these multiplicities are 1 [7, II, §8]. 

Inserting z = 1 in this trace formula we see that the number r of signa- 
tures of B extending o is at most equal to the rank [B: A] of the projective 
A-module B. If |A| is semilocal and J, = id, moreover r= [B: A] mod 2. 

In the present paper we study the extensions of o to B in the case that B is 
a “Frobenius extension” of A. For any element a of a ring with involution we 
denote the image of a under this involution by ὦ. We call B a Frobenius 
extension of A if B is as an A-module finitely generated and projective and if 
there exists a linear form 5: B— A on this module such that 5(6) = s(b) for ἢ 
in B and the hermitian form s(Xy) on the A-module B is nondegenerate. If 
the involutions are trivial, this is indeed the usual notion of Frobenius 
extension occurring in the literature (cf. Eilenberg and Nakayama [4]). We 
also say that s is a Frobenius form on B. 

If B is finite etale, the trace ΤΥ, 4 is a Frobenius form on B. But the class 
of Frobenius extensions is much larger than the class of finite etale exten- 
sions. This is the reason the present paper seems to be a necessary step in the 
theory of hermitian forms over rings. For example, let a be an element in A 
with ἃ = a. The extension B ‘= A[X]/(X* — a) = A[x] with J, extending J , 
and x = x is Frobenius. But if 2 or a is not a unit in A, then B 15 not etale. 
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Certainly such extensions are important for a study of hermitian forms. A 
useful example of a cubic Frobenius extension occurs in [9]. 

We essentially apply the same methods as in [7], cf. in particular §§3 
and 4 of [7], and we use also the terminology and the notations of [7]. Our 
results are satisfactory only in the case that |A| is semilocal. Much work 
remains to be done. For example, we do not know in general whether a 
given signature o of A has only finitely many extensions to a given 
Frobenius extension B of A. As has been shown in [7, §3] this is true if B is 
finite etale over A. 


1. The Transfer Formula 


In this section A is an arbitrary commutative ring with involution. Let 
φ: A—B be a Frobenius extension of A and s: B—> A a Frobenius form. 
Then s induces an additive map 


s*: W(B) > W(A), 


which is defined as follows. Let (E, ®) be a hermitian space over B, i.e., a 
finitely generated projective B-module E equipped with a nondegenerate 
hermitian form ©. (® is assumed to be antilinear in the first and linear in the 
second variable.) Then s* maps the Witt class [E, ®] of this hermitian space 
to the Witt class [E, s ο ®] of the A-module E equipped with the hermitian 
form s o ® (cf. Scharlau [12]). Clearly s* is W(A)-linear, i.e., we have 


s*(p,(x)y) = x " s*(y) (1.1) 
for x in W(A) and y in W(B). 
Now let o be a signature on A. We denote by δίῳ, σὴ) the set of all 
signatures τ of B that extend oa. 


Theorem 1.1 There exists a unique family (m(t)|t € S(@, σ)) of integral 
numbers such that almost all m(t) = 0 and for every z in W(B) the following 
equation holds true: 

a(s*(z)) = Σ, m(t)t(z). 
Here τ runs through the set S(@, σὴ of all extensions of σ to B. (If this set is 
empty, the equation reads o(s*(z)) = 0.) 

Proof The kernel P(t) of a signature τ: W(B)-— Z is a minimal prime 
ideal of W(B) [3. 7]. Thus there do not exist any inclusion relations between 
the kernels of different signatures of B. This already implies that there exists 
at most one family (m(t)|t € S(g, o)) for given ~, o,s with the above 
properties; cf. Knebusch [7, I, p. 72]. 


170 MANFRED KNEBUSCH 


To prove the existence of such a family (m(t)) we choose ἃ homomor- 
phism « from A into a real closed field with involution R inducing o. In more 
explicit terms this means the following. | R| is an algebraic closed field. Jz is 
nontrivial, and thus the fixed field Ry of Jp is a real closed field in the sense 
of Artin—Schreier. We have 


W(R) = W(Ro) XZ 


(Ry equipped with the trivial involution), and denoting the unique signature 
of R by p we have p ° a, = a. Such a homomorphism « exists according to 
[7, 1, 84]. 

The tensor product Β ®, R with respect to g and «, equipped with the 
involution J, ® Jp, is a Frobenius extension of R with the Frobenius form 
5601 from Β @,R to R. It is easily checked that the following diagram 
commutes: 


W(B) ——~> W(B @, R) 


(1 @a)x 
s* | | (s Ὁ 1)» 


w(A) ---α WIR) 


We have a unique direct decomposition 
9 
Β Ba R = [ |B: 
i=1 


into connected rings with involution B;. Let g;: Καὶ > B; denote the compon- 
ents of the homomorphism g @ 1 from R to Β @, R, and α;: B > B; the 
components of the homomorphism 1 @a from B to Β Ὁ, R. We may 
assume that for some r,0 <r <g all rings | B,| with 1 <i <r are connected 
and all | B;| with r < i <g are not connected. Then for i > r, the ring B; with 
involution is isomorphic to a product D; x ἢ, of two copies of a ring D; with 
the “switch” of the factors as involution. Thus W(B;) = 0 fori > r. Fori <r, 
we denote by N; the nil radical of B; and we have 


B; = p(R) ®N;. 


Since 2 is a unit in Καὶ and N, is nilpotent, @; induces an isomorphism @,;, from 
W(R) = Z onto W(B;) (cf. e.g., [14, Theorem 2.2.1]). 

Let s;: B; > R denote the restriction of the R-linear form 5 @ 1 to the 
direct summand Β; of the R-module B @, R. Clearly, 5; is a Frobenius form 
with respect to g;: R— B;. Denoting finally the projection from B @®,R 
to B;, by p;, the induced maps p; from W(B @,R) to W(B;) yield an 
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(Pie, --+> Pre): WB Ὁ, ΚΕ) ——— []W(B 


i=1 
and for u in W(B @, R) we have 


(ὦ 1)*(u) -Σ S¥Dis(u 


Now it is not difficult to prove the desired formula for os*(z), with z in 
W(B). We have 


os*(z) = ρα, s*(z) = p(s @ 1)*(1 Ὁ α), (z) 
= Lost Pie(1 @ a), = Yost On (Z 


Let y; denote the unique homomorphism from B; to R with y; ° g; = id 
(1 <i <r). We have yj. ° @js = id. Since g;, is an isomorphism this implies 
Dix ° Vix = 1d. We obtain 


os*(z) = Σ PSF Qin Vis Oin(Z) = > PSF Pix Bin(Z), 
i=1 i=1 


with B; denoting the homomorphism j; ° a; from B to R. Applying (1.1) this 
can be simplified to 


os*(z = ¥ pst(1) ) > pB;.(z). 


Notice that the B; are precisely all homomorphisms β from the ring with 
involution B to R with β ὁ φ = «. For any signature t in δίσ, @), we denote 
by I(t) the set of all indices i, 1 <i <r, such that p o B;, = τ. Of course only 
finitely many I(t) are nonempty. Further, we denote by B(t) the product of 
all B; with i in I(t). We have 


Poa ae) 


Finally, we denote by s, the restriction of 5 69 1 to the direct summand B(t) 
of the R-module B @, R. Clearly, s, is a Frobenius form on the extension 
B(t) of R. Using these notations we can write 


as*(z) = 2 m(z)t(z) 


tid 


with 
m(t) = ps¥(1). (1.2) 
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In more explicit terms, m(t) is the signature of the hermitian form s,(xy) on 
the vector space B(t) over R. 

This proves our Theorem 1.2 and also gives some insight into the nature 
of the coefficients m(ct). 

The coefficients m(t) depend only on ᾧ and τ. We denote them hereafter 
by n(t, 5). The notation n(t, φ, 5) or n(t, A, s) will be appropriate whenever 
it is not clear from the context which base ring A is under consideration. The 
formula 

os*(z) = δ᾽ n(t, 5)τ(2) 
τ|σ 
will be called the transfer formula for φ: A -- B and 5. 

From (1.2) we deduce in the case that the rank [B: A] of the projective 

A-module 1s constant, the modest information 


2, [πίτ, s)| <[B: A] (1.3) 


Furthermore, 
n(t, 5) = [B(t): R] mod 2. 
Thus if [B: A] is constant, we see that 
δ᾽ n(t, s) =[B: A] mod 2. (1.4) 


tla 
From our proof of the transfer formula also the following corollary is 
evident. 


Corollary 1.2 Let s be a Frobenius form on B with respect to 9: A — B. 
Let o be a signature on A and t be an extension of σ to B with n(t, 5) # 0. Then 
for any homomorphism a: A > R into a real closed field with involution R that 
induces o there exists a homomorphism B: B — R that induces τ and extends a, 


ie, βοφ -τεα. 


In general πίτ, 5) may well be zero (cf. Section 2). We mention two cases 
in which πίτ, 5) # 0 for all extensions τ of o to B. 


Remark 1.3 If @ is finite etale and 5 is the regular trace Tr,,, then all 
n(t, s) are positive numbers. Moreover, given a homomorphism ἃ from A 
into a real closed field with involution R inducing o the number n(z, 5) 
coincides with the cardinality n(t, «) of the set of homomorphisms β from B 
to Καὶ that induce τ and extend « (cf. Knebusch [7, I, §3]). Indeed, in the proof 
of the transfer formula now the ideals N;, 1 <i <r, are zero, and the Fro- 
benius forms s;: B; > Καὶ are the inverse maps of the isomorphisms @;: R -> B;. 
This implies immediately n(t, 5) = n(t, a). 
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If E is an hermitian spacet over A of constant rank n then |o(E)| <n for 
every signature o on A. Indeed, let α: A -» Καὶ be a homomorphism into a real 
closed field with involution R inducing o. Then o(E) is the usual Sylvester 
signature of the hermitian space E @, Καὶ over R. We call E positive definite 
at o if o(E) = n. Returning to our extension g: A > B with Frobenius form 
s: B-> A we denote by s*<1) the hermitian space B over A, equipped with 
the hermitian form s(xXy). 


Proposition 1.4 If B has constant rank over A and s*<1) is positive 
definite at o, then for every signature t on B extending o the coefficient n(t, 5) 
is positive. Moreover, given a homomorphism a from A to a real closed field 
with involution R inducing o we have πίτ, s) = n(t, a). 

Proof Returning to the proof of the transfer formula we see that all 
spaces s*<1>, 1 <i <g, are positive definite over R. Thus r = g. Further- 
more, the ideals N; must be zero since otherwise the highest power N@ of N; 
that is nonzero would be an isotropic subspace of s*<1>. Thus s¥<1> has 
rank one and s*<1) has rank πίτ, «). We obtain 


n(t, 5) = ps¥<{1> = n(t, a). 
We still have to show n(t, s) > 0 for a given extension τ of ¢ to B. We choose 
a homomorphism £ from B into a real closed field with involution καὶ induc- 


ing τ, and we put a ‘=f o g. Then certainly n(t, «) > 0 for this particular τ, 
hence n(t, s)>0. QED 


We now look at what happens to the coefficients n(t, 5) if we replace s by 
another Frobenius form s’ with respect to g. We have 


s'(x) = s(bx) 
where b is an element of B with b = b uniquely determined by s and s’. Since 
the hermitian form 5 (Χγ) is nondegenerate, b must be a unit of B. Let <b> 
denote the hermitian space over B consisting of the B-module B and the 
hermitian form bxy. We denote the class of this space in W(B) again by <b), 
and the value of this class under a signature t on B by 1(b). Clearly 
s'*(z) = s*(<b)z) 
for z in W(B), hence 
as'*(z) = δ᾽ n(t, s)t(<b>z) = δ᾽ πίτ, s)t(b)t(z). 
τ|σ τ|σ 


Thus we have 
πίτ, 5) = t(b)n(z, 5) (1.5) 
for every signature t on B. Notice that t(b) = +1 since <b>? = 1. 


+ We write for brevity E instead of the pair (E, ®). 
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2. Some Examples 


We first study quadratic extensions. Let A be an arbitrary commutative 
ring with involution, and let a be a unit in A with @ = a. We consider the ring 


B:= A[T]/(T? — a). 


We have B = A @ At with the relation t? = a. We extend the involution J , 
to B either by prescribing t = t (case I) or by prescribing t = —t (case II). 
We introduce the A-linear form s on B with s(1) = 1, s(t) = 0. Then in 


both cases s(x) = s(x) for every x in B. Using the basis 1, t of B over A, we 
obtain 


s*<1) = <1, a> (case I), 
SC) ye {!, --αΣ (case IT). 


Thus s is a Frobenius form on B over A. 
Let o be a signature on A. 


Proposition2.1. (i) Assumet = tIf +t o(a) = —1, theno has no extension 
to B. If o(a)= +1, then o has precisely two extensions τι, τὰ to B, and 
n(t,, 5) ΞΞ πίτ;, 5) = 1. 

(ii) Assume t= —t. If o(a)= +1, then o has no extension to Β. If 
o(a) = —1, then o has a unique extension t to B and n(t, 5) = 2. 


Proof If t =t, then a= tt is a norm in B, and a signature o on A with 
σ(α) = —1 cannot be extended to B. If t = —t, then —a = tt is a norm in B, 
and a signature σ with o(a) = +1 cannot be extended to B. In the remaining 
cases s*<1) is positive definite at o. Thus by Proposition 1.4 every extension 
t of o to B has coefficient n(z, s) > 0. Furthermore, 

Σ᾽ n(t, 5) = os*(1) = 2. 


tla 
Thus one of the following two possibilities must occur: 


(A) o has precisely two extensions τι, τ, and n(t,, 5) = πίτ;, 5) = 1. 
(B) o has a unique extension τ, and πίτ, 5) = 2. 


Anyway, o has at least one extension to B, and we choose such an extension 
τι. We further choose a homomorphism f, from B to a real closed field with 
involution R inducing τι. We put « ‘=f, © g, with φ the inclusion map from 
A to B. Then « induces oa. Let y denote the automorphism of B over A of 
order 2, defined by y(t) = —t. Notice that y is indeed compatible with the 
involution J, in both cases. Consider the homomorphism β. ‘=f, ° y from 


+ Cf. the notations at the end of Section 1. 
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B to R. We have βι(() + B,(t) and thus £, is different from f,. Clearly, B, 
and f, are all homomorphisms from B to R extending «. Let p denote the 
unique signature of R. If t = t, then we have a hermitian space <t> over B 
consisting of the B-module B and the hermitian form txy. We have 


(p © βχ.)() = --(ρ ο Bis)XdD. 
Thus there exist two different signature p o B,,and p © B,, extending o, and 
possibility (A) is realized. If t= —t, then 
βχτεῦᾳ ο βι. 


Now J, is an automorphism of the field with involution Καὶ that induces the 
identity on W(R) (cf. [7, I, Lemma 3.11]). Thus p ὁ By. = po By. = τι, and 
we obtain n(t,, «) = 2. According to Proposition 1.4, we have n(t,, s) = 2, 
and possibility (B) is realized. 

If B = A[T]/(T? — a) but a is not a unit in A, the situation may be very 
different. 


Example 2.2 Let A be the localization of the polynomial ring R[X,, ..., 
x,] in n 2 2 variables x; with respect to the maximal ideal generated by x,, 
..., X,, and let πὶ denote the maximal ideal Ax, + --: + Ax, of A. We choose 
the element 


a:= —(x? -Ὁ --- Ὁ x?) 
in mand study the extension 
B=A@ 4, t? =a, 


of A, both A and B being equipped with the trivial involution. The A-linear 
form s on B defined by s(1) = 0, s(t) = 1 is a Frobenius form over A. Indeed, 


0 1 
s*{1>= | | 
1 a 
The ideal IN:= mB is the unique prime ideal of B lying over A, and 
A/m = B/M= ΚΝ. 


We consider the evident signatures attached to m and Wt. 


ao: W(A)> W(A/m) ——— Z, τ: W(B)> W(B/M) —— Ζ 


Notice that τ extends o. Since A is regular, o can also be extended to a 
signature of the quotient field R(x,, ..., x,) of A, equipped with the trivial 
involution, as has been shown by Craven, Rosenberg, and Ware [2]. Thus 
there exists an injection « of A into a real closed field with involution R 
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inducing o. But this homomorphism «a certainly does not extend to a homo- 
morphism f from B to R since a(a) is a negative element of the fixed field Ry 
of Jp, and f(t) would be an element λ of Ry with A? = a(a). In particular 
n(t, s) = 0. 


Remark 2.23 From Theorem 3.1 in the next section it can be easily 
deduced that τ is the only extension of σ to B. 


We now shall study extensions of type B = A[T]/(T"). We have the fol- 
lowing general fact. 


Lemma 2.4 Let φ: Α -- B be a homomorphism in the category of commu- 
tative rings with involution. Let NR, and Mt, denote the nil radicals of A and B 
and A, B denote the reductions A/N, and B/N,. Assume that φ induces an 
isomorphism @: A > B. Then every signature of A extends in a unique way toa 
signature of B with respect to φ. 

Proof We have a commutative diagram 


A en eer 


| 
A == B 


with z and π΄ the canonical surjections. It suffices to prove the lemma for the 
homomorphisms z and π΄. Then it will be evident for g too. Thus we assume 
hereafter that B = A and ῳ is the canonical surjection from A onto A. 

Let o be a signature on A. We choose a homomorphism « from A into a 
real closed field with involution R inducing o. We have a unique homomor- 
phism ἃ from A to R with ἃ ο m = α. Let τ denote the signature on A induced 
by a. Then τ extends o. The remaining assertion that t is the unique exten- 
sion of o now follows from the fact that the map g, from W(A) to W(A) is 
surjective. Since I did not find a reference for this probably well-known fact 
in full generality, I indicate a proof. 

We show that for a given hermitian space (U, h) over A there exists a 
hermitian space (E, ®) over A whose reduction mod 9, is isometric to 
(U, h). There exists a finitely generated projective A-module E, unique up to 
isomorphism, such that E/N, E is isomorphic to U (cf. e.g., Swan [13, 
Theorem 2.26, p. 89]). Thus we assume a priori that U is the reduction E ofa 
given finitely generated projective A-module E modulo 9t,. We now can 
find another hermitian space (V, h’) over A such that U @ V isa free module 
over A. Indeed, let W be a module over A such that U@ W is a free 
A-module of finite rank. Then take 


(V, h’):=(U, —h) L H(W) 
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with H(W) the hyperbolic space W@W* constructed from W. 
(W* = antidual module of W, W Ὁ W* equipped with the obvious hermi- 
tian form which is zero on W x Wand on W* x W*.) We again regard V as 
the reduction mod N, of a finitely generated projective A-module F. The 
module G:=E@ F over A is free. Let g,, ..., g, be a basis of G over A, and 
let J,, ..., Jn denote the corresponding basis of the reduction G = ὕ Ὁ V. 
We now lift the hermitian form h 1 h’ of G to a hermitian form w of G by 
lifting the hermitian matrix of h | h’ with respect to the basis g,,...,g, toa 
hermitian matrix over A in an arbitrary way. Let ® denote the restriction of 
w to E. Clearly, ® reduces mod MN, to the nondegenerate form h on U, hence 
® itself is nondegenerate. Thus (E, ®) is a hermitian space over A that has 
mod 9, the reduction (U, h). 


Remark If A contains an element p with p + fi = 1 (e.g., 2 is a unit in 
A), then by well-known arguments any two hermitian spaces over A that 
have isometric reductions mod M, are themselves isometric. Thus the can- 
onical map from W(A) to W(A) is an isomorphism. We do not need this fact. 


We now consider an arbitrary commutative ring A with involution and 
study the extension B = A[T]/(T") for some n 2 2. We have 


B=A@QA®:@OA"', =O, 
and we extend J, to an involution J, on B by prescribing t = t. 
Proposition 2.5 The A-linear form s on B defined by 
s(l)=s(t)="-=s(t""*)=0, s(t" *) = 1 


is a Frobenius form over A. Every signature σ on A has a unique extension t to 
B (cf. the preceding Lemma 2.4). n(t, 5) = 1 ifn is odd, and n(t, 5) = O if nis 
even. 


Proof We have indeed s(X) = s(x) for x in B. If n = 2m + 1, the space 
s*<1> has the orthogonal decomposition 


m—1 
s*{1) = (Ar") LL (Ati t+ A?™5). 


i=0 


Thus using an obvious notation 
0 1 
s*c1>= <1) L mx Ἷ at 


If n = 2m, we have the orthogonal decomposition 


m-1 
s*{1) = 1 (Ati + AP!) 


i=0 
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and 
0 1 
RIS 
απ ἢ At 


Thus on the Witt ring level s*(1) = 1 ifn odd and s*(1) = 0 ifneven. Let now 
o be a signature on A and τ be the unique extension of σ to B. By the transfer 
formula 


1, n odd 


ee ‘ n even 


3. Integral Extensions of Semilocal Rings 


We now assume that our ring with involution A is semilocal, 1.6., | A | has 
only finitely many maximal ideals. In this case there exists for every signa- 
ture σ on A a prime ideal p(c) of A intimately related to σ, whose definition 
and relevant properties I want to recall (cf. [7, 1, 84 and Appendix Β]). 

Let A, denote the fixed ring of J,, equipped with the trivial involution. 
We introduce for σ a given signature on A the set Γ(σ) consisting of all units 
a of Ay with o(a) = +1, and the set Q(c) consisting of all finite sums 


N(A,)ay ἘΠ t+ N(A,)a, 


with a; in Γ(σ) and “norms” N(A,) = 4,4; of elements 4; of A such that the 
ideal generated by A,,..., 4, is the whole of A. Clearly Q(c) is a multiplicative 
subsemigroup of 49. Let — Q(c) denote the set of all elements — x with x in 
Q(c) and p(c)y the complement of Q(c) ὦ (— Q(a)) in A. The following facts 
have been proved in [7] and in a more special situation already by Kanzaki 
and Kitamura [6]. 


P1 A, is the disjoint union of Q(c), —Q(c), and p(o)o. 

P2 ρίσ)ο is a prime ideal of 40. 

P3 The set of all x in A with N(x) lying in p(c)o is a prime ideal p(c) of A, 
obviously stable under J,, and this is the only prime ideal of A lying over 
p(o)o- 

P4 There exists a unique signature σ᾽ on the quotient field A(p(c)) of the 
ring with involution A/p(c) extending o with respect to the natural map 
from A to A(p(o)). 

P5 (oc) contains every other prime ideal q of A such that q is stable under 
the involution of A and o extends to some signature on A(q). 
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Let σο denote the restriction of o to 40. Since the natural map from W(A)) 
to W(A) is surjective, σ is the only extension of og to A. Clearly (a9) 
coincides with Γίσ). 


P6 Ο(σο) coincides with Q(c) and p(a9) coincides with p(o)o. The natural 
map from A (p(c)o) to A(p(c)) identifies Ag(p(o)o) with the fixed field 
A(p(c))p of the involution of A(p(c)). The signature δ is the unique extension 
of to A(p(oc)). 

P7 I(d,) is the set of all fractions ao” ' with a, τ the images in Ay /p(a)o of 
elements u, v of O(c). Notice that Γ(δο) is just the set of positive elements of 
the ordering of Ag(p(a),.) corresponding to do. 

P8 Every element s of Q(c) can be written in the form 


s=a, + N(A,)a, -Ἑ τ: + N(A,)a, 


with a,,..., a, in Γ(σ) and 22, ..., A, in A. (If Ap has no residue class fields 
with fewer than four elements, then even s = a, + a2 with a,, a, in Γ(σ).) 
P9 I(ca) is the intersection of Ο(σ) and the set Aj of units in 40. 

P10 Q(c) + p(o)o = Qc). 

Pil For x in A but x not in p(a), the norm N(x) lies in Q(c). 


We call p(c) (resp. p(a)9) the prime ideal of A (resp. of Ag) associated with o. 

Let now gy: A > Bbea homomorphism from A to another semilocal ring 
with involution B, and let φο: Ag + By denote the restriction of @ to the 
fixed rings of the involutions. Let further τ be a signature on B and let σ 
denote the restriction t o g, of t to A with respect to φ. 


Theorem 3.1 Assume φ is integral, i.e., every element of B is integral over 
the subring p(A). Then p(c) is the preimage of p(t) under φ, and p(a)o, resp. 
Q(c), are the preimages of p(t)o, resp. Q(t), under Qo. 

Proof p(t) is the unique prime ideal of B lying over p(t), and p(a) is 
the unique prime ideal of A lying over p(c)). Thus it suffices to prove the 
statements about p(c)o = ρίσο) and Q(c) = O(a). Replacing g by go, we 
assume without loss of generality that A and B both have trivial involutions. 

Without any assumption about ¢ it is clear that Q(c) is contained in 
gp 'Q(t) and —Q(oa) is contained in φ΄ '(-- Ο(τ)). Considering the comple- 
ment of Q(c) ὦ (—Q(c)) in A we learn that p(c) contains g p(t). (This also 
follows from the fact that o extends to a signature on B(p(t)) with respect to 
the obvious map.) To finish the proof of the theorem it will suffice to show 
that Q(c) coincides with p~ 'Q(r). Indeed, considering again the complement 
of Q(c) ὦ (—Q(a)) in A, this will imply that also p(o) coincides with 


Suppose there exists an element t in φ΄ 'Q(r) that does not lie in Q(o). 
Clearly, t also does not lie in —Q(oa) since this set is contained in 
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φ΄ '(—Q(z)). Thus t must lie in p(c). On the other hand, we have an equa- 
tion (cf. P8) 

p(t) = δι + AZb, + τ + 22}, (x) 
with b,,..., δ, in Γ(τ) and A,, ..., A, in B. 

We consider the field L:= B(p(t)), equipped with the ordering corre- 
sponding to the signature τ. For b in B, we denote the natural image in L by 
b; and for a in A, we denote the natural image of g(a) in L also by ὦ for 
conciseness. We now make the following observation. Let a be an arbitrary 
element of A. Then 1 — at lies in Q(c) since t lies in p(o) (cf. P10). Thus 
1 — g(at) lies in Q(t), and the element 1 — ἄγ in L is positive. Now b, > 0 
and, according to («), δ, <t. Thus we learn that 


ab, <1 
for every a in A. Introducing the element d:=b,' of B, we have 
a<d (+) 


for every a in A. But this is impossible since B 15 integral over A. Indeed, we 
have an equation 


d" + o(a,)d""' +--+ o(a,) = 0 
with some n = 1 and elements a; of A. This implies the equation 
ὧν ἃ, ἀ 1 +---+4,=0. (x4) 
Consider the element 
c‘=1+6,a,+°''+6&,a, 
with ες = +1 if 4; = 0 and ε, = —1 if 4; < 0. Then in an obvious notation 
ὃ -- 1 -- |a,/+---+ [a]. 


By a well-known lemma the equation (*««) implies} d < é, which contradicts 
(*«x). Thus an element t as above cannot exist, and we have Q(c) = φ΄ ‘Q(t). 
This finishes the proof of Theorem 3.1. 


This theorem has the following consequence which is important in view of 
the preceding sections. 


Corollary 3.2 We assume again that φ: Α -- B is an integral homomor- 
phism between semilocal rings, that o is a signature on A, and τ an extension of 
a to B. Let « be ahomomorphism from A to areal closed field with involution R 
that induces o and has kernel p(c). (Such a homomorphism clearly exists.) If 
the field L:=B(p(t)) has trivial involution or if the field K ‘= A(p(a)) has 
nontrivial involution, then there exists a unique homomorphism B from B to R 


+ In fact, d < Max(1, |a,| +--+ |@,|). 
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that extends α, i.e., a = B © w, and induces τ. In the remaining case that K has 
trivial involution but L has nontrivial involution there exist precisely two such 
homomorphisms B, and B,, and B, =Jro βι. 

Proof « factors through a unique homomorphism ἃ from K to R, and ἃ 
induces the signature on K. The field L is an algebraic extension of K and 
the signature τ of L extends δ. We want to investigate how many homomor- 
phisms B from L to R exist that extend ἃ and induce τ. This is possible by the 
classical Artin-Schreier theory of orderings and real closures. 

Assume first that L, hence also K, has trivial involution. Then ἃ maps Καὶ 
into the field Ry which is real, closed in the classical sense, and also B must 
have its image in R,. By Artin-Schreier’s theory there exists a unique homo- 
morphism β from L to Rog that extends ἃ and induces τ. 

Assume now that K, hence also L, has nontrivial involution. Let 
ἄο: Ko > Ro be the restriction of ἃ to Ky. Again we have a unique homo- 
morphism B, from Ly to R that extends ἄρ and induces the signature το on 
Lo (το = restriction of τ = signature induced on Bo(p(to)) by to). Now Lis 
the tensor product of [0 and K over Ky. Thus there exists a unique homo- 
morphism f from L to R that extends both ἃ and fy. The signature induced 
on L by β extends το and thus coincides with τ. Clearly β is the unique 
homomorphism from L to R that extends ἃ and induces T. 

Assume finally that K has trivial involution and L has nontrivial involu- 
tion. We have a unique homomorphism βο from Ly to Καὶ that extends ἃ and 
induces το. There exist precisely two homomorphisms βι and B, from Lto R 
that extend B, since Lis a quadratic extension of Ly and | R| is algebraically 
closed, and B, = Jz © B,. Both homomorphisms induce signatures on L that 
extend το. Thus these signatures both coincide with τ. 

In this way we have found the homomorphisms B from L to R that 
induce τ and extend ἃ in all cases. Composing these homomorphisms β with 
the natural map from B to L we obtain all homomorphisms from B to R that 
have kernel p(t), induce τ on B, and extend α. 

To complete the proof of our corollary it remains to be shown that any 
homomorphism β from B to R that extends « and induces τ has kernel p(t). 
Let q denote the kernel of β. Then q lies over the prime ideal p(c) of A. 
Furthermore, we obtain from β a signature on B(q) that extends τ. Thus 
q < p(t). Since also p(z) lies over p(a) the prime ideals p(t) and q must be 
equal, according to a well-known theorem of Cohen-Seidenberg about 
prime ideals of integral extensions (e.g., [1, §2, 1, Corollary 2]). 

From this Corollary 3.2, now proved, we obtain immediately 


Corollary 3.2 Let 9: A— B be a finite homomorphism between semilocal 
rings with involution, and assume that B can be generated as a module over A 
by n elements. Then an arbitrary signature on A has at most n extensions to B. 
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Indeed, just observe that in the situation of Corollary 3.2 the homomor- 
phisms from B to R that extend a correspond one to one with the homomor- 
phisms from Β @, A(p) to Καὶ that extend the homomorphism ἃ from A(p) to 
R induced by «. The algebra B ©, A(p) has rank at most n over A(p). 

If we study the extensions of a signature o on A with respect to our 
integral homomorphism g: A— B, we may pass from A and B to their 
localizations with respect to p(a). This is a consequence of the following 
localization lemma. 


Lemma 3.4 Let o be a signature of the semilocal ring with involution A, 
and let S be a multiplicative subset (= subsemigroup) of Q(a). Assume that the 
localization S~'A of A with respect to S is again semilocal. Then there exists a 
unique signature ὅ of S~' A extending o with respect to the natural map from A 
to 5. ἾΑ. We have p(G) = S~'p(c), p(@)o = S~'p(o)o, and Ο(δ) = S~ O(c). 

Proof Since the natural map from A to A(p(c)) factors through the map 
from A to S~'A, there exist extensions of ¢ to S~!A. Let τ be one of them. 
Clearly, 5. ΓΓ(σ) is contained in Γ(τ), hence S~'Q(c) is contained in Q(c). 
Let u be a unit of the fixed ring S~'Ag of the involution of S~1A. The set 
δ᾽, ΤΑ is the disjoint union of the prime ideal S~'p(c)o and the sets 
+S~*Q(c). Thus u lies in one of the sets +5~'Q(c). According to Property 
ΡΟ, applied to τ, we have t(u) = +1 ifu lies in S~'Q(o), and t(u) = —1 ifu 
lies in —S~'Q(c). Since τ is determined by the values on the hermitian 
spaces of rank one [Knebusch et al. 10, Proposition 2.16], we see that τ is the 
only signature of S~ 1A extending σ. 

We have a natural map from 5. ‘A to A(p(c)) that identifies the field 
A(p(c)) with the residue class field of S~'A with respect to the prime ideal 
S~'p(c). The restriction of the signature δ᾽ on A(p(c)) to S~'A is an exten- 
sion of o and thus coincides with τ. This implies that S~ 'p(c) is contained in 
p(t).. We now know that S~ 'p(o)o is contained in S~'p(t)) and from above 
that S~'Q(c) is contained in Q(t). Since S~ ‘Ag is the disjoint union of the 
sets S~'p(o)o, - 5. ᾽Ο(σ), and also the disjoint union of the sets p(t), 
+Q(t), we have 


S~*p(a)o = P(t)o » S~*Q(c) = Q(t). 


It remains to be shown that S~ 'p(c) coincides with p(r). Let x be an element 
of S~*A that does not lie in S~*p(a). Then x = 5΄ ἦγ with s in S and γ in A 
but y not in p(c). According to Property P11, the norm N(y) lies in Q(o), 
hence N(x) lies in S~'Q(c), which coincides with Q(t). Thus, again by P11, x 
does not lie in p(t), and we see that the subset S~'p(c) of p(t) actually 
coincides with p(t). This finishes the proof of Lemma 3.4. 


Proposition 3.5 (i) Let g: AB be a finite homomorphism between 
semilocal rings with involution. Let o be a signature on A and let p denote the 
associated prime ideal ρ(σ). Then o has a unique extension ὅ to A,, and every 
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extension τ of o to B has a unique extension T to B, . Thus we have a one-to-one 
correspondence between the extensions of o to B and the extensions of ὅ to B,. 
(ii) The set Q(z) consists of the elements uv~* with u in Q(t) and v in 
Q(c), and p() is the localization p(t)B, of p(t). 
(iii) Assume that in addition φ is a Frobenius extension and that s is a 
Frobenius form on B over A. Let 5: B, > A, denote the induced Frobenius form 
on B, over A,. Then n(t, s) = n(t, 3) for every extension τ of σ to B. 


Proof Parts (i) and (ii) of the proposition follow from the preceding 
lemma, applied to both A and Band the multiplicative set δ = Q(c). Indeed, 
by Property P1l we have S-'A =A, and S”-'B=B.,. Notice that S is 
contained in Q(t) for every extension τ of o to B. 

Assume now that 5: B— A is a Frobenius form over A. For y in W(A), 
we denote the image in W(A,) by Κ and for z in W(B) the image in W(B,) by 
z. We have 


as*(z) = G(s*(z))~ = &(8*(2)) = Σ᾽ n(%, 5)z(Z) = Σ᾽ πᾷ, 3)t(z). 


τ|σ τ]σ 
Thus indeed π(ζ, 5) = πίτ, 5) for all signatures t on B extending σ. 


Example 3.6 Assume in the situation of part (iii) of the preceding pro- 
position that B, is finite etale over A,. Then πίτ, 5) # 0 for every signature τ 
on B extending σ. More precisely, |n(t, s)| = 2 if A(p(c)) has trivial involu- 
tion and B(p(t)) has nontrivial involution, and |n(z, s)| = 1 otherwise. 

This follows from our Proposition 3.5 using (1.5) and the determination 
of the multiplicities n(z) for finite etale extensions of semilocal rings in 
[7, II, Proposition 8.5]. (In [7] it is assumed that A and B are connected. 
This restriction can be removed easily in our situation.) 


Our theory of prime ideals associated with signatures on semilocal rings 
and the consequences drawn from this theory in the present section im- 
mediately generalize to “weakly semilocal rings” as considered in [7]. I call 
a ring A with involution weakly semilocal if A contains a semilocal ring A’ 
(stable under the involution) such that A is integral over A’. Then A is the 
limit of an inductive system of semilocal rings with involution with finite 
transition morphisms. Weakly semilocal rings with involution are a more 
natural category for the present section than semilocal rings with involution 
since an integral extension of a weakly semilocal ring is again weakly 
semilocal. 


4. Frobenius Extensions with One Generator 


Let A be a commutative ring with involution. 


Proposition 4.1 Assume B is a Frobenius extension of A generated by one 
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element 3; B = Α[8]. Let σ be a signature on A and « a homomorphism from B 
to a real closed field with involution R inducing o. Then for any extension τ of o 
to B and any.Frobenius form s on B over A, we have 


[πίτ, s)| “ πίτ, a) 


with n(t, α) denoting the number of homomorphisms B from B to R that extend 
a and induce t. 


This is an improvement of a previous inequality (1.4). To prove the 
proposition we run through the proof of Theorem 1.1, and we see that it 
suffices to show that for the Frobenius extensions B; of R (1 <i <r) occur- 
ring there we have ps*<15 =O or = +1. Now every B; is generated over R 
by one element. Thus our proposition is evident if we verify the following 
lemma. 


Lemma 4.2 Let B be a finite extension of a real closed field with involu- 
tion R, and assume that B is generated over R by one element. Then B is a 
Frobenius extension of R. For any Frobenius form s on B we have s*(1) = +1 
if |B: R] is odd and s*(1) = 0 if [B: R] is even. 

Proofp B=R@®N with N the nil radical of B. We assume Ν # 0. Let 9 
be a generator of B over R. We have 9 = c + μ with c in Κα, u in N, and u is 
again a generator of B. Let n denote the smallest natural number with μ = 0 
(n 2 2). Then 1, u, ..., u"~' is a basis of B over the field R. Clearly 


i= u(A + v) 


with A in R, A #0, and v in N. Denoting the unit J + v by « we have εξ = 1 
since u =u. There exists an element p in R such that » + Af #0. Then 
B:= + eff is a unit of B, and B = εβ. (This is the classical procedure for 
solving the equation ¢ = BB~'.) Introducing the element t := Bu, we have 
t =t,and 1,t,...,t"~' is again a basis of B over R. We introduce the R-linear 
form s on B with 


5“() -- --- = s(t" 2) =0, s(t" ἢ - 1. 


According to Proposition 2.5 this form is a Frobenius form and s*(1) = 1 ifn 
is odd, s*(1) = 0 if n is even. If s’ is another Frobenius form on B, then 
s'*(1) = +s*(1) (cf. end of Section 1). This finishes the proof of the lemma. 


As a relevant example we consider an extension 
B= A[T\/f(T) 


with f(T) a normed polynomial in A,|T] of degree n 2 2. Let t denote the 
image of T in B. Then 1, t,..., t" ἢ is a free basis of the A-module B. We 
extend the involution J, to B prescribing t = t. We consider the A-linear 
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form s on B defined by 
s(1) = s(t) =---=s(t""7)=0, s(t? *) = 1. 


The hermitian matrix of the hermitian form s(Xy) with respect to our basis 1, 
t,..., t"' has the shape 


1 


and thus is nonsingular, hence s is a Frobenius form. 

We now assume in addition that A is semilocal. Let σ be a signature on 
A, and let p respectively p,. denote the prime ideal of A, resp. 40. associated 
with o. Let further K denote the residue class field A(p), and as always let Ko 
denote the fixed field of the involution J, , which coincides with Ap(po). The 
image f(T) of our polynomial f(T) € Ao[T] in Κι [7] has a decomposition 


7(T) = ΠΡ)» 


with pairwise different normed irreducible polynomials p,(T) over Ky. Let τ 
be a signature on B extending o. Then p(t), is a prime ideal of By lying over 
Po and thus 


P(T)o = Po Bo + pPT)Bo 


with j uniquely determined by τ, 1 <j <s, and p,(T) a preimage of p(T) in 
A,[ T]. Since p(t) is the unique prime ideal of B lying over p(t),, the polyno- 
mial p(T) remains irreducible over K, and 


p(t) = ΡΒ + p,T)B. 
We call p,(T)* the factor of f(T) corresponding to τ. 


Proposition 4.3 Let p,(T)* be the factor of f(T) corresponding to τ. Then 
n(t, s)= +1 ife; is odd and πίτ, 5) = 0 if e; is even. 

Proof Wechoose a homomorphism ἃ from A to a real, closed field with 
involution R that has kernel p(c) and induces o. There exist either one 
homomorphism f or two homomorphisms β, β' from B to R which extend «, 
and in the second case β' = Jp © β (cf. Corollary 3.2). Let A denote the image 
of t under f in both cases. Then ἢ = A, and / is a zero of the polynomial pT] 
obtained from pT) by applying « to the coefficients since β factors through 
B(p(z)). Going through the proof of Theorem 1.1, we see that with the 
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notations introduced there 
B(t) = R[T]/(T — 4)”. 


Our proposition now follows from Lemma 4.2 or already from Proposition 
2. 


It would be more difficult to determine the sign of πίτ, s) if e; were odd. We 
do not enter into this. 
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Generalizations of Gauss’s Lemma 


EMMA LEHMER 


BERKELEY, CALIFORNIA 


The original Gauss lemma is generalized in various ways: First, by replacing the integers 1, 2,..., 
(p — 1)/2 by any subset a; of (p — 1)/2 distinct integers less than a prime p such that a; # p — a; 
and having any desired property enjoyed by half the integers < p; next by replacing the integers 
less than p by any symmetric (modulo p) coset of kth power residues of p; and finally by replacing 
the prime p by a composite modulus m and the integers less than p by the totatives of m. 

Application is made to criteria for the kth power character of 2, to the quartic reciprocity law, 
and to the character of a permutation. 


Every textbook in elementary number theory contains a simple and 
elegant result known as Gauss’s lemma. It will be found in a little more 
general form in Hasse [3] and for our purposes can be stated as follows. 


Theorem 1 Let p= 2n+ 1 be a prime. Let S = {1, 2,..., p — 1}, and let 
δι and δ, be two half-sets of S defined as follows: 


S, = {a\, a,,..., a)} with a;#p—aj, a;#a; 
and 
δι) = {075-05 3 ἐς: 4] where αἵ = Aa; (mod p). 
If S, contains μι elements that are not in S,, then the Legendre symbol 
(A/p) = (— 1). 
187 
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Corollary 1 Let a; = ifori = 1,2,...,n, then μι. is the number of elements 
in 5. that exceed n. This is the original Gauss lemma. 


Corollary 2 Let a; = 2i— 1 fori = 1, 2,..., n, then p, is the number of 
even elements in S,. This special case will be found in Shanks [7]. 


We next generalize Gauss’s lemma a little further as follows: 


Theorem 2 Let P be any property enjoyed by half the elements a, but not 
by p — a; of S = {1, 2, ..., p — 1} where p is an odd prime, and let μι and μι be 
respectively the number of elements in S, and S, of Theorem 1 that do not 
possess property P, then 


(Δ) = (-- 1 τὴ 
If property P is enjoyed by all the elements of S,, then μι = 0 and μ; is 
the number of elements in δ that are not in S,, which is Theorem 1. 
Before proving Theorem 2 and hence Theorem 1 as well, we first genera- 
lize it to subsets of the first p — 1 integers as follows: 


Theorem 3 Let p= 2kn + 1 be a prime, let a, < a, < °°: < a, < pand 
let 


S = {8}; ἂχ, ..., Az,} with Qan+1-i — Ρ — GQ; 


be any coset of the kth power residues of p and let 4 be a kth power residue of p. 
Let P be a property enjoyed by half the elements a; of S, but not by p — a;. Let 


S, = {a}, a2,..., a,} with a;#p—aj, αἱ Ξ αἱ 
and 
S, = {a}, ay, ..., ay where a; = Aa; (mod p) 


be two half-sets of S, having respectively μι and μι; elements not possessing 
property P, then 


‘A 
= if k is odd, 
ε- 


λ 
[ if k is even. 
P} 2k 
Proof Since a; + p — aj, it follows that a7 ΞΕ p — aj (mod p). If either a; 


or a; does not possess property P, then its complement modulo p must 
possess property P, but there are μι such elements in S,, and yp, in S,. 
Therefore, the product of all the elements in S, 


[lai =4"[ [a = 4"(— 1)" = (-- 1)π (mod p) 
i=t Ξ1 


t 
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where z is the product of all the elements in δ, having property P. Since p 
does not divide zt, we have 


(-- 1)γ1 1} μὰ = Ans Jedi = ( (mod p). 
P] 2k 


Since A is a kth power residue, it will be a 2kth power residue for k odd if 
it is a quadratic residue. Hence the theorem follows. If k = 1 this becomes 
Theorem 2. 


If all the elements of S, satisfy property P, then Theorem 3 is a generali- 
zation of Theorem 1 with k = 1. 

We next give some applications of Theorem 3 in case 2 is a kth power 
residue. 


Theorem 4 Let p = 2kn + 1 and suppose that 2 is a kth power residue of 
p, then the parity of the number of elements exceeding p/2 in a half-coset 
S, = {a}, az, ..., a} with αἱ # p— a; 


of any coset of kth power residue modulo p is the same as the parity of the 
number of odd elements in the half coset if and only if 2 is a 2kth power residue 
of p. : 

Proof Let P be the property that an element is less than p/2 and let P’ 
be the property that an element is even. Let μι and μ; and, correspondingly, 
μί and p’, be the number of elements that do not possess property P and P’ 
in 5, and S,, respectively. Then by Theorem 3 with A = 2 we have 


(2/p)>, = (— 1417" = (- 1 ert μ2' 
Or 
μι + M2 = Hi + He (mod 2). 


But every element which exceeds p/2 in S, generates an odd element in 
S,, therefore μι = μΏ and hence p, = μί (mod 2) so that 


(2/p) when_k is odd 


(-- 1)" + py’ 
(2/p) 2x when_k is even, 
which is the theorem. 


Corollary If k is odd, and if 2 is a kth power residue, then 
μι = μή (mod 2) if and only if p = +1 (mod 8). 


This corollary was first proposed as a problem for k = 1, for the set 
δ. 5 (σ΄, "ἢ, .... 95 
where g is a primitive root of p = 2nk + 1. 
Example 1 Let k = 3, p = 31, g = 3, αἱ = 3*' (i= 1, 2, 3, 4, 5). 
S, = {27, 16, 29, 8, 30}, μιΞΞ 4, μι =2. 
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As predicted by the corollary the p’s have the same parity. 


Theorem 5 Let p = 2nk + 1, let v, be the number of elements < p/4 in the 
half-set S, of any coset of kth power residues, where 


S, = {a\, az,..., a,} with a; < p/2. 
If 2 is a kth power residue, then 


(—1y™ = (2/p) if Κα is odd 
~~ \(2/p) 2, if k is even. 
Proof Using Theorem 3 with 2=2 we note that for every 
a; = 2a; (mod p) which exceeds p/2 there is an αἱ that exceeds p/4, therefore 
Vv, = nN — μ, and since μι = 0, the theorem follows. 


Corollary If k is odd, the number of kth power residues less than p/4 is of 
the same parity as (p — 1)/2k if and only if p = +1 (mod 8). 

For k = 1, the theorem is trivial. For k = 3, p = 31, we have a; = 1, 2, 4, 
8, 15, so that v3 = 3, 30/6 = 5. 

For k = 2, there are many expressions for the quartic character of 2 
beginning with Gauss in terms of p = a? + 4b’, Barrucand and Cohn [1] in 
terms of c? + 8d? and the class number h = h(./—4p), and by Hasse [4] in 
terms of h’ = h(,/—8p). Since 2 must be a quadratic residue, p = 1 (mod 8), 
letting n = 2π' in Theorem 5, and we have 


(-- 1)" ΞΞ (* = (-- 1}} = (-- {γῇ es (—1)"th/4 ἘΞ (—1)""*. 
4 

For example, for p = 73, there are v, = 10 quadratic residues less than 
p/4,n’ =9,b=4,d=3,h=4, μ' = 16, all testifying to the fact that 2 is a 
quartic residue of 73. 

Going one step further, we find that for k = 4, the quartic residues of 73, 
less than p/4, are 1, 2, 4, 8, 16, so that v, = 5,n = 9, and hence 2 is an octic 
residue of 73 by Theorem 5. It is also well known that (2/p)g = (—1)"*"/* so 
that in general v, = b/4 (mod 2). 

Another application of Theorem 3 is as follows: 


Theorem 6 The multiplication of the elements of any coset S of kth power 
residues of a prime p = 2nk + 1 by a kth power residue 4 produces an even 
permutation if and only if 


[} if k is odd 


if k is even. 
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Proof Since p = 2kn + 1, the set S is symmetric modulo p. Let S, con- 
tain the elements a; < p/2 and δ΄, contain the elements p — a; fori = 1,2,..., 
n. By Theorem 3, multiplication of a; Ε S, by λ will contain μι. elements that 
exceed p/2, where 


1) if k is odd 
(—1)"= 


(‘) if k is even. 
P/ 2k 


If these x, elements of δ᾽; are interchanged with their complements in the set 
δ... obtained by multiplying the set δὴ by 4 (mod p), the resulting set will 
consist of n elements < p/2, followed by their complements modulo p. If 
each half is now subjected to the same interchanges, the parity of the permu- 
tation will not change so that the permutation is of the same parity as p,, 
which proves the theorem. 


Corollary The multiplication by λ of the integers 1,2,..., p — 1 produces 
an even or odd permutation according as A is a quadratic residue or not. 


This is Zolotareff’s theorem [8] which follows from Theorem 5 with 
k = 1, or directly from Theorem 1. 

Theorem 6 is applicable to all primes p = 1 (mod k) if k is odd, but only 
to those p = 1 (mod 2k) if k is even. However, in the remaining cases k even 
and p=k + 1 (mod 2k) the permutation can be shown to be always even by 
another method. 

Gauss made good use of his lemma in his third proof of the law of 
quadratic reciprocity. If we use Theorem 3 with A = A(p) = q and then again 
with p replaced by q and with A(q) = p, we obtain the following theorem: 


Theorem 7 Let p = 2nk + 1, q = 2km + 1, and let (p/q), = (q/p), = 1, 
let u,(p) be the number of elements exceeding p/2 among the multiples of the 
first half of the kth power residues of p by q, while μρ(4) is the corresponding 
number of multiples of p by the first half of the kth power residues of q that 
exceed q/2 modulo 4, then 


(— 1)γμκῳ Hato) — bias =(-1)? "4-04 ΕΓ k is odd 
(p/q)2x(q/P) 2x if k is even. 
Corollary If k is odd, then 


1+4y,(q)(mod 2) if p=q=-—1 (mod 4) 


Hal?) = ε' (mod 2) otherwise. 


When k = 1, this is the reciprocity theorem. When k = 2, p = 4n + 1 weare 
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dealing with multiples of quadratic residues (or nonresidues) of p and 4 such 
that (p/q) = (q/p) = 1 and obtain 


(2) (2). = (— 1) 00+ Hal) 


Other criteria for the quartic reciprocity law are known, such as Burde 
[2] and Lehmer [6] in terms of p=a*+b?, g=c?+d? and of 
p = C? + qD’, when such a representation exists. These are 


[2 [2 -ὐ|5::5}-ἰ if q=8n+1, p=C*+qD’ 
GQ) a\P} 4 p } \(-1P if q=8n+5 


so in this case 


g=8n+1, p=C*+4+qD? 
= d 2 
μι) = gla) (mod 2) [1 POE ta 


OT 


Example =k = 2, p = 41, q = 37,a=5,b=4,c = 1,d = 6, ἢ = 1, (p/q) 
4(q/P)4 = —1 since the quadratic residues modulo 41 < p/2 are 1, 2, 4, 8, 9, 
10, 16, 18, 20 multiplication by 37 (mod 41) gives 37, 33, 25, 21, 9, 5, 1, 18, 10, 
2 of which the first four exceed p/2, hence μ5γ(41) = 4, while the quadratic 
residues of 37 « 42 are 1, 3, 4, 7, 9, 10, 11, 12, 16 multiplication by 
41 (mod 37) gives 4, 12, 16, 28, 36, 3, 7, 11, 27 (mod 37) with only three 
numbers exceeding 18, hence μ,,(37) = 3 and the result follows. 


In conclusion, we give a generalization to a composite modulus m as 
follows: 


Theorem 8 Let m> 4 and let n= $(m)/2. Let 
S = {m,, mz, ..., M2,}, m,<m and (m,,m)=1 


be the set of totatives of m. Let P be a property enjoyed by half the elements m,; 
of 5, but not by p — m;. And let 


S, = {m\, m,,..., m,} with m; +m, #0 (mod m) 
and 
S,={m{,mz,...,m,} where mj; = Am; (mod m) 


be two half-sets of totatives of m having respectively μι and μι elements which 
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do not possess property P, then 


(-- 1) 11 μὰ = (A/p) y m=p or 2p 
| 1 otherwise. 


Proof Let M be the product of all the totatives of m having property P, 
and let M, and M, respectively be the products of all the elements in S, and 
S,. Since either m; or m — m; is an element of S, and similarly either m/ or 
m — m; is an element of S ,, it follows that M, = (—1)*'M (mod m) and that 


M,=(-1)#M = A"(—1)*!M (mod m). 
Since (M, m) = 1, we obtain 
(—1)1* μὰ = 4" (mod m). 
But it is known that for m > 4, 


Jn -- Jomy2 — (4/p) if m= p* or 2p* 
1 otherwise, 


so that the theorem follows. 


Theorem 8 reduces to Theorem 3 with k = 11ifm = p and gives a genera- 
lization of Zolotareff’s theorem mentioned in the corollary to Theorem 6 to 
a composite modulus as follows: 


Theorem 9 The multiplication of the totatives of m> 4 by a number A 
prime to πὶ produces an even permutation if m # p* or 2p*. In case m = p* or 
2p*, the permutation is even or odd according as (A/p) = 1 or —1. 

Proof Let S, contain the totatives m; < m/2 of m and S', those that 
exceed m/2 and let property P be that m; < m/2, then μι = 0 in Theorem 8 
and μι is the number of elements in δ᾽ that exceed m/2. If these elements are 
interchanged with the corresponding elements in δ΄, obtained by multiplying 
δῖ by A, the resulting half-sets will contain elements m/ < m/2 followed by 
m — m; > m/2. The permutation can now be completed by making corre- 
sponding interchanges in both sets, so that the parity of the permutation is 
exactly μ and the theorem follows from Theorem 8. 

For example, for m= 18, n=3, A=5, S={1, 5, 7, 11, 13, 17}, 
δι = {1, 5, 7}, δ᾽, = {11, 13, 17}, δ, = {5, 7, 17}, S5 = {1, 11, 13}, and μ, = 0, 
με = 1, so that there is an odd number of interchanges. In fact, we inter- 
change 1 and 17, then 1 and 5 and 13 and 17, and finally 5 and 7 and 11 and 
13, so there are actually five interchanges and since m = 2 - 37, p = 3 and 
(A/p) = (5/3) = -1. 

To avoid confusion, it should be noted that Hasse’s [5] generalization of 
Gauss’s lemma to composite modulus finds the parity of the number N of 
solutions (x, y) of the inequality 4x — my < 0 with x < m/2 and prime to m 


194 EMMA LEHMER 


and y < 4/2 for λ ἃ prime. It turns out that N = μι (mod 2) unless p = 2p*in 
which case N is always even, while μι is odd in case (A/p) = —1 as in the 
above example. On the other hand, for A = 5, m = 18 we have the solutions 
(1, 1), (1, 2), (5, 2), and (7, 2), giving N = 4. 
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Let p be a prime of the form 4k + 3 and let s, t be nonnegative integers. It is shown that any 
solution to |a|? + |B |? = p' in the algebraic integers of the cyclotomic field Q(e?™!”) must have a 
or B equal to zero. 


Let p be a prime of the form 4k + 3. It 1s well known from elementary 
number theory that the diophantine equation 


a? + b? = p*! 


has only the trivial solutions with a or b equal to zero. In this paper we 
extend this result to the algebraic integers in the cyclotomic field Q(e?*/?*) 
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obtained by adjoining a primitive p’th root of unity to the field Q of ra- 
tionals. We prove 


Theorem Let p be a prime of the form 4k + 3 and let s, t be nonnegative 
integers. If « and B are algebraic integers in Q(e?™/”*) that satisfy 


Jal? + |Bl? =p’, (1) 
then either « or B is zero. More generally, the same conclusion holds if 
Ja|* + |B? = [ρ|, (2) 


where p is an algebraic integer in Q(e?™!”*) that divides some power of p. 

Proof Let Z denote the ring of rational integers and let p be a prime of 
the form 4k +3. If αὖ +b? =0 (mod p) with a,be Z, then a=b= 
0 (mod p); for a # 0 (mod p) would imply that (b/a)? = —1 (mod p), which 
is impossible. The diophantine equation a? + b? = p' has no solution if t is 
odd by a mod 4 argument; and if t is even, an easy induction on t shows that 
either a or b is zero. This proves the theorem for s = 0. 

Now suppose that s is a positive integer. Let € = e?"/?". Then Z[C] is the 
ring of algebraic integers of Q(£) and Z[¢ + ζ7 is the ring of algebraic integers 
of Q(¢ + €), the maximal real subfield of Q(¢). Furthermore, {1, ζ) is an 
integral basis of Q(¢) over Q(¢ + ζ). Let a, B © Ζ[ζ] satisfy (2), where 
p € Z[C] is a divisor of some positive integral power of p. For the moment, 
assume that p is not a unit and that the t occurring in (1) is not zero. Note 
that any positive integral power of p can be expressed in the form pp since 


p={[|, (1 — ©), where j ranges over a reduced residue system modulo p*. 
Write 


a=a,+a,l, B=b,+5b,6, (3) 


where a,, az, b,, b, € Ζ[ζ + ζ]. The principal ideal generated by p factors in 
Ζ[ζ] as (p) = (1 — 9, where @ denotes the value of the Euler ¢-function at 
p*. Thus p = e(1 — CY for some positive integer r and a unit ¢ ε Ζ[ζ]. Hence 


|p|? = sta’, (4) 
where 
ω -Ξ (1 -- Πγ( -- ἢ Ξ 2 -- ξ τ ἢ (5) 
Then (2}-.(5) yield 
aa + BB = αἰ + aj + a,a,(2 -- ΟἹ + bj + b3 +b, b,(2 — aw) 
= (a, + a,)* + (δ, +b)? — (a, a, +b, b,)o 


= δδω. 
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Therefore, in Ζ[ζ + ζ] 
(a, + 43)? + (by + by)? =0 (mod 0») 


The residue class ring Z[¢ + []/(w) is a finite field whose order is the norm of 
w, that is, p. As noted previously, the equation a” + b? = 0 has only the 
trivial solution a=b=0 in the field of order p =3 (mod 4). Hence 
a, + a, = 0 (mod o), say a; + a, = nw with ἡ € Z[¢ + ζ]. Thus 


a= a, +a,6=a, + (nw — α()ζ =a,(1 —C) + nol. 


Therefore, 1 — ζ divides «. Similarly, 1 — ¢ divides B. Write « = a,(1 — ζ), 
B = B,(1 — 6), where «,, B, belong to Ζ[ζ] and satisfy 


Ja, |? + |B, |? = ceo" *. 


Proceed by induction on r to obtain 


[α, [5 + |B,|? = εξ. (6) 
Let a, = a,/€, Bo = β, 8 so that 
Jeol? + |Bol* = 1, (7) 


with a, Bo € Ζ[ζ]. Observe that (1) and (2) are already of the form (7) and 
(6), respectively, if t = 0 or p is a unit. The Galois group of Q(¢) over ὦ is 
abelian, so in particular complex conjugation commutes with algebraic 
conjugation. Therefore (7) implies that all conjugates of a) do not exceed 
one in absolute value. Hence by a theorem of Kronecker, & 1s zero or a root 
of unity; and similarly for βο. Whence by (7), either ἀρ or Bo is zero. Thus « 
or B must be zero. This completes the proof. 


The theorem is of course false for primes of the form 4k + 1, for such 
primes can be expressed as the sum of two squares in Z. Suppose p = 2. It is 
not difficult to show that 


Jaf? + [BPP = 2 (8) 


has only trivial integral solutions in Z(e”™/?*) with « or B zero if and only if t 
is even and 5 = 0 or 1. Indeed, (8) has a solution with a = β in Z or Z[i] 
according as t is odd or even. 
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1. Introduction 


In his dissertation, E. Artin proved [1] that there are only finitely many 
quadratic extensions E/k(x) for which the ideal class group of the integral 
closure of k[x]| has exponent 2 if k is a prime field of odd characteristic and 
the infinite prime does not split. The inequalities from which Artin drew the 
conclusion about the finiteness also give upper bounds on the genera of such 
fields. Ὁ. Madden [4] gave a substantial improvement of these upper 
bounds. He also included the case of characteristic 2 and allowed k to be any 
finite field. In Section 3 we give a further improvement of Artin’s theorem. 
The null class group has exponent 2 for such fields 2. Using the class number 
formula for the ambiguous classes and counting the number of integral 
divisors in a divisor class of sufficiently large degree, an application of the 
Riemann hypothesis gives a bound on the genus. If there is no ramified 
prime of degree one, these bounds are better than those obtained by 
Madden. 


199 


200 MANOHAR L. MADAN AND SAT PAL 


In Section 2 we evaluate the Galois cohomology groups for the null class 
group and the divisor class group associated with a cyclic extension E/F of 
prime degree |, F being an algebraic function field over a finite field of 
constants. The formula giving the number of ambiguous classes plays a key 
role in this evaluation. Knowing the ramification, one can calculate the 
ambiguous class number from this formula, except in the case when the [ἢ 
roots of unity are contained in the field of constants and the degrees of all the 
ramified primes are divisible by /. In this case the least positive degree of an 
invariant divisor class is 1 or /. Both the possibilities are realized. This 
answers affirmatively a question of Rosen [5]. 


2. Galois Cohomology 


Let E/k be a cyclic extension of prime degree ἰ of an algebraic function 
field F/k having the finite field k as its exact field of constants. We use the 
following notation: 


Dre(Dog) the group of divisors (of degree 0) of E 
Cr(Cog) the group of divisor classes (of degree 0) of E 
[ε{10ε}] — the group of idéles (of degree 0) of E 

Je(Joz) the group of idéle classes (of degree 0) of E 


Pr the group of principal divisors of E 

t the number of ramified primes 

OE /F) the gcd of | and the degrees of the ramified primes 

ζ the minimal positive degree of an invariant divisor class 
G = gal(E/F) 


h'(G, Μὴ the order of the cohomology group H'(G, M). 


We use the same symbol to denote a field and its multiplicative group. 
Finally, for a G-module M, M® will denote the submodule of invariant 
elements. 

We evaluate the cohomology groups for the various G-modules that are 
associated with the extension E/F. In particular, we prove 


Theorem 1 (a) If E/F is unramified, 


h°(G, Cog) = h*(G, Cor) = h°(G, Cz) 


=[h'(G,C;)=l? τ k contains the /th 
roots of unity; 


=] if k does not contain 
the /th roots of unity. 
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(b) If E/F is ramified, 


h°(G, Cog) = h'(G, Cog) = 6(E/F)f if k does not contain the 
Ith roots of unity; 


= ff! if k contains the Ith — 
roots of unity and 
O(E/F) = 1; 


= cI if k contains the [ἢ 
roots of unity and 
O(E/F) = 1; 


h°(G, Cz) = lh'(G, Cz) =F if k contains the /th roots of 
unity and 6(E/F) = 1; 


= ft? otherwise. 


Proof Since G is cyclic, H'(G, M) = H'**(G, ΜῈ) Further, if the module 
is finite, by Herbrand’s lemma, h'(G, M) = h'*'(G, M). Thus, 


h*(G, Coz) = h°(G, Cor) = [Cr: Norm(Cos)] 
= (Cor: Cor][Cor: Norm(Cos)], (1) 


Cor denoting the image of Co, under the canonical conorm map Co; > Cog 
induced by the inclusion F c E. 
We have the ambiguous class number formula [6] 


Re = [C&q: 1] =e ΤπΠὶ (2) 


where hy = [Cor: 1] is the class number of F; a the minimal positive degree 
in E of a divisor of F; ἢ the elements in the multiplicative group of k that are 
norms from E; ἡ the elements of k that are norms of units of E; and ε the 
kernel of the norm map restricted to the units. 

We also know [2] 


h 
[Cor: 1} = εἰ if E/F is unramified and k 
contains the ἢ roots of unity; 
he otherwise. (3) 


From (2) and (3), we can calculate [Cé,: Cor]. 
To evaluate the second factor on the right-hand side in (1), consider the 
canonical exact sequence of G-modules 


lo Ε-»]] -οῦὐ,--]. 
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In cohomology, it gives the exact sequence 
1+ E°= FoI1f=J?—>H'(G, E). 


The group H'(G, E) is trivial by Hilbert’s Theorem 90. Therefore, every 
invariant idéle class contains an invariant idéle. Also, it follows from 
definition that 


Ig = Tp, [6 = Tor. 
Thus, 
JE=SJr, Joe = Jor- 
Consider the canonical map 
Jor Cor- 
Restricted to J§,, it gives the epimorphism 
Joe = Jor > Cor: 
Norms correspond to norms. Therefore, 
h°(G, Jor) = [Jée: norms] = [Cor: norm(Co,)]. (4) 


Considering the rational integers Z as a G-module under trivial action, 
we have the exact sequence of G-modules 


(Si ξυῦ». Es. 730 (5) 


We are using here F. K. Schmidt’s theorem [7] which states that for finite 
k the algebraic function field has a divisor of degree one. From (5), we 
obtain, in cohomology, the exact sequence 


p 
H~1(G, Z) > H°(G, Jor) > H°(G, Jz) > H°(G, Z) > H*(G, Jos) 


— H'(G, Jz). (6) 


From class field theory, H'(G, J,) is trivial and H°(G, J,) has order I. 
Also, 


crossed homomorphisms 
PACD <W116 7) 2 eee 

principal crossed homomorphisms 
invariant elements = Z 


0 Z ed ee lt 
HG, 2) norms IZ 


Idéle classes of F have their degrees multiplied by ἰ when considered as 
idéle classes of E. Therefore, the map p in (6) is the zero map. Hence, 


h°(G, Jor) = H°(G, Jz) = h'(G, Jor) = |. (7) 
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From (1), (4), (7), (2), and (3), we have 
h°(G, Cor) = I[C&r: Cor] 


= [?(hy/h,p) if E/F is unramified and k 
contains the ἢ roots of 
unity; 

= I(hg/hr) otherwise. (8) 


As mentioned above, the degree of a divisor of F gets multiplied by /| 
when considered as a divisor of E. Therefore, by F. K. Schmidt’s theorem, 
a = | in (2). From (2) and (8), we have 

h°(G, Cor) = εἴη: η] if E/F is unramified and Κα 
contains the [ἢ roots 
of unity; 


_ ela: 1] 
[e: 1] 


We separate, now, the ramified and the unramified cases. 


otherwise. (9) 


E/F Unramified Let E/F be the extension obtained by making a con- 
stant extension of degree ἰ. Let Γ = gal(E/E) = gal(F/F). As in [2], consider 
the exact sequence of T-modules, induced by the inclusion F c E, 


1-.Ν- Ορε-- Cor 1. 
In cohomology, it gives 
1-. Nt > Cor > (Cor) > A(T, N) > ΗἾ(Γ, Cop). 
If k contains the /th roots of unity, 
l=(N)' =N, Cons Con: μ᾽ (Γ, Cor) = 1, Wr, ΝῚὴ ΞΞ Ι. 


Therefore, hy = [((05}}: 1]. If k does not contain the ἢ roots of unity, 
then N = 1 and the last equation is still valid. 
Clearly, (Coz)' < C§,. Therefore, 


hehe. (10) 


For an unramified extension, every unit is a norm. If k does not contain 
the ἢ roots of unity, this is trivial because, then, every unit is norm of a unit. 
If k contains the /th roots of unity, this is a result of class field theory. Thus, 
in each case, 


[7: ] 
erin (11) 
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From (2), (10), and (11), 


C= pone. (12) 
Substitution in (8) or (9) gives 
h°(G, Cor) =h'(G, Coz) =? τ Καὶ contains the Ith 
roots of unity; 
=] otherwise. (13) 


To calculate h°(G, C;) and h'(G, Cg), consider the exact sequence of 
G-modules 


lo Corp > Cer a Z— 0. (14) 
In cohomology, (14) gives 


μ 
1> C§; > 6 -.Ζ-»Η (6, Coz) > H'(G, Cz) > 1. (15) 


By (12), the image under y is /Z. Therefore, (13) and (15) imply, if Κα 
contains the {ἢ roots of unity, 


h'(G, Cor) = [? -- Ih'(G, Cz). (16) 
By Herbrand’s lemma, 


h°(G, σε) h°(G, Z) _ h°(G, Coz) _ h°(G, Z) 


W(G, Cz) WG, Z) h(G, Coz) h1(G, Z) 
Therefore, (16) gives 


h°(G, Cp) = 2. (17) 
Similarly, 
h'(G, Cr) = Ls h°(G, Cr) = (18) 


if k does not contain the /th roots of unity. 
Before we study the ramified case, we evaluate H'(G, P,) in general. To 
that end, consider the exact sequence 


lo k->E-P,- 1, 


obtaining in cohomology 


λ 
Η΄ ἴ(Οσ, E)= 1-H '(G, Pz) > H°(G, k) > H°(G, ΕἸ --".... 


Therefore, h~ '(G, P;) = order of the kernel of A. If k does not contain the ἢ 
roots of unity, h°(G, k) = 1. If k contains the /th roots of unity, h°(G, k) = 1. 
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In this case a generator of k is a norm from E iff it is everywhere locally a 
norm iff the degrees of the ramified primes are all divisible by /. This follows 
from Hasse’s norm theorem. Therefore, 


h~'(G, Pg) =h'(G, Pe) =! if k contains the /th roots 
of unity and the degrees of 
the ramified primes are 
all divisible by /; 


=] otherwise. (19) 
E/F Ramified We assert that 
C = 0(E/F) 
= the minimal positive degree of an invariant divisor, (20) 


if k does not contain the /th roots of unity. For, consider the exact sequence 
1>- P,-D,;->-C;,- 1, 
and the induced cohomology sequence 
1— P§ > D§ > C$ > H'(G, P;) > --:: (21) 


(19) and (21) imply that every invariant class contains an invariant divi- 
sor. Therefore, € = 6(E/F). Substitution in (9) gives 


h°(G, Coz) = 6(E/F)f — if _k does not contain the 
Ith roots of unity; 


ΞΕ sie if k contains the Ith roots 
of unity and 6(E/F) = 1; 
= οἱ if k contains the /th roots 
of unity and 6(E/F) = l. (22) 


The orders h°(G, Cz) and h'(G, C,) are calculated, as in the unramified 
case, by considering the exact sequence (14). The values are 


h°(G, Cg) = lh'(G, Ce) =F if k contains the /th roots 
of unity and 6(E/F) = 1; 
=['*' otherwise. (23) 


From (13), (16), (17), (22), and (23), we see that proof of Theorem 1 is 
complete. 


We observe that it is only the last case in (22) that involves ¢. Rosen [5] 
asked the question if ¢ = 1 can, at all, occur in this case. An affirmative 
answer is provided by the following. 
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Proposition Let characteristic k # 2 and P(x) be an irreducible monic 
polynomial of degree 4n + 2 in k[x]. Then E = καὶ p(x)) contains an invar- 
iant class of degree one. 


Proof Consider the quadratic constant extension E. The polynomial 
p(x) decomposes as the product of two monic irreducible polynomials over 
the extended field of constants. The formula (2) for the ambiguous class 
number shows that the class number of E is odd. Since gal(E/E) operates on 
Cog and Co; is the invariant subgroup, it follows that με is also odd. Now, the 
ambiguous class number formula applied to E shows that it has an invariant 
class of degree one. 


Remark 1 Some of the arguments used in the proof of Theorem 1 also 
appear in Rosen’s paper [5]. We have repeated them here to make this paper 
self-contained. 


Remark 2 It was proved [2] that h, divides h, if E/F is a normal exten- 
sion. Rosen [5] gave another proof. We observe that the normality is an 
unnecessary restriction. Using some results from the theory of abelian var- 
ieties, one can, in fact, show that the quotient of the zeta functions is a 
polynomial with rational integral coefficients. To give an arithmetic proof 
for hy |h,, one can assume that there is no field strictly between F and E, and 
that the canonical map Cor > Cog has a nontrivial kernel. This implies that 
E/F is a pure extension of degree ἰ, a prime. The proof [2] for the divisibility 
is valid in this case. If E/F is nonnormal, a proof can also be given by 
considering a Hilbert class field H of F, 1.6., a maximal abelian unramified 
extension of F that has k as its exact field of constants. Then, H ὦ E=F 
and EH/E is abelian and unramified. A simple argument shows that k is the 
exact field of constants of HE. Therefore, by the reciprocity law, Cor is a 
subgroup of Cor. 


3. A Theorem of E. Artin 


Let k be a finite field and k(x) be the field of rational functions in one 
indeterminate. Following Artin, we call a quadratic extension E/k(x) imagin- 
ary if the infinite prime of k(x) does not split in E. Let R be the integral 
closure of k[x] in E. Then, R is a Dedekind domain with finite class group. 
There are only finitely many imaginary quadratic extensions E/k(x) for 
which the class group of R has exponent 2. We wish to obtain bounds on the 
genus of such fields. We shall assume that the genus σὲ = g > 1 and, hence 
[4], |k] Ξ 4 5.5. 

If h, is the class number οἵ R, one has the relation [7] 


πῃ, = [πε ἢ (24) 
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where f= 1 or 2 according as the infinite prime ramifies or is inert. In each 
case, Co, is a Subgroup of the class group of R. Therefore, its exponent is also 
2. (We leave out of consideration the 5 fields [3] for which h; = 1.) Now, Cog 
has exponent 2 iff it is equal to its subgroup of ambiguous classes, i.e., 


a = ἥ: ἢ 
hz =h,=t x 2! tc fot = 2, (25) 
Let m be any natural number not less than 2g — 1. Then, by the 
Riemann—Roch theorem, the dimension of a class of degree m is m — g + 1. 
The total number of integral divisors in all the classes of degree m is 


2(gr-#t* — 1g — 1). 


Also, by the Riemann hypothesis, the constant extension of degree m has, at 
least, q”/*(q”/* — 2g) primes of degree one. Considering that a prime of E of 
degree dividing m can give, at most, m primes of degree one in the constant 
extension, we see that E has, at least, ι΄ 'g™/*(q”/* — 2g) integral divisors of 
degree m. Therefore, the exponent of Co, is larger than 2 if 


(q — 1)q”7(q"? — 2g) > m2*(q™"9** — 1). (26) 


We consider the two cases when the characteristic is even and when it is 
odd. 
Case 1 characteristic = 2 In this case, (20) and (25) give 


hp = hp = 24 = εις 1 = δ(ΕΙΚ(Χ))2:  ". (27) 


The genus formula gives 2g + 2 = D, D denoting the degree of the differ- 
ent. We observe that the ramification being wild, a ramified prime of degree s 
makes a contribution of, at least, 2s to D. 

Consider the case when g = 2 and no prime of degree one ramifies. For 
g = 9, (28) gives 24 < 23. Substituting m = 2g — 1, d = 3 in (26), we verify 
that the inequality is satisfied. We omit the formal argument showing that 
(26) is satisfied for all g > 9. It is not satisfied for g = 8. 

When q = 4 and no prime of degree one ramifies, one sees, similarly, that 
(26) is always satisfied. 

If gq = 4 or 2 and a prime of degree one ramifies, Madden [4] has shown 
that g <2, g <4, respectively. These bounds are better than those derived 
from (26). 

Case 2 characteristic different from 2 Consider the case when no 
prime of degree one ramifies. By (26) and [1], 


h, = 2hp = 2! if 5(E/k(x)) 
=2'-! if 5(E/k(x)) = 


2 
L. (28) 
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Therefore, 
ἢ, -- 24 -- 2 1 if d(E/k(x)) ΞΞ 2 
= 22 if δ(ΕΙΚ(Χ)) = 1. (29) 


In this case, the ramification is tame. A ramified prime of degree 5 makes a 
contribution of, at least, s to D. Taking m = 2g — 1, one can show that (26) is 
satisfied for 4 = 3 and S if g > 5, g > 2, respectively. For example, if g = 3, 
g = 5, then D = 12. This implies t < 5. Therefore, from (29), d < 4, in each 
case. Substituting in (26) shows that it is satisfied. 

If a prime of degree one ramifies, Madden’s estimates are better. Combin- 
ing our results with those of Madden, we have 


Theorem 2 Let E/k(x) be an imaginary quadratic extension such that the 
integral closure of k|x] in E has class group of exponent 2. Then, the null class 
group of E also has exponent 2. The various possibilities are 


q=1,9,  g=1; 
q=45, 9g 82; 
q = 3, g <4; 
q= 2, g <8. 


If no prime of degree one ramifies, g = 2 is not possible for q = 4 and 5. 
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On Some Special Decimal Fractions 


K. MAHLER 


AUSTRALIAN NATIONAL UNIVERSITY 
CANBERRA, AUSTRALIA 


For a special countable set of irrational numbers, infinitely many integral multiples are constructed 
the decimal expansions of which begin with very large numbers of the digit 9. 


Real irrational numbers have the well-known property that the frac- 
tional parts of their multiples lie dense, and even are uniformly distributed, 
between 0 and 1. In a different direction I recently proved [2] the following 
result: To every positive integer n there exists a second positive integer 
P = P(n) such that, if « is any real irrational number, then there is a positive 
integer p = p(a, n) satisfying 1 < p < P(n) such that every possible sequence 
of n digits 0, 1, 2,..., 9 occurs infinitely often in the decimal expansion of pa. 

In the present note, I shall establish a result of a somewhat different kind. 
Let f(x) be a positive integral valued polynomial of degree m > 1, and let 
a(f) be the decimal fraction obtained by writing the decimal forms of f (1), 
f (2), f (3), ... successively after the decimal point. Then, for every sufficiently 
large positive integer N, among the first 10%’ digits after the decimal point of 


(10 — 1)(10? — 1) --- (10% — 1))5Ὲ1σ, 
there are at most (m+ 1)N° digits distinct from 9. 
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Almost forty years ago I had studied these numbers [1] and proved that 
they are transcendental, but are not Liouville numbers. In the present note I 
apply both notations and results of this old paper. 


We begin with an almost trivial lemma. 
Lemma_ Let 


be a convergent series where the r, are positive rational numbers, and the i, are 
strictly increasing positive integers. Denote by dy a common denominator of ro , 
r1,---, Ty, and by ey and ὃν positive integers such that 


dy > max(ro,1y,---> tn) < 10°, dy- Σ ril0-*< 107%. 


v=N+1 
If 
Ey > Ney, 
then at least ὃν — Ney of the first ὃν digits after the decimal point in the 
decimal expansion of dyo are equal to 9. 
Proof The positive integer dyro can be written as 
ἀν Υο ΞΞΞ (duro a 1) + 0.999 ry 


where the decimal fraction has only digits 9. Hence 


N fo) 
dyo = (dyro — 1) 4+ 0.999 ... — δ᾽ ἀντ, 101" — dy - Σ r, " 10°. 
v= 1 v=N+1 
Here, by the definition of e, and ¢,, each of the decimal representations 
for the integers dyr,, dyr2,..., ἀν ν Contains at most ey digits distinct from 
0, and the decimal expansion of the convergent series 


ἐν ᾿ Σ ry ᾿ 10“ 
v=N+1 
has only zero digits in the first ἐν places after the decimal point. On account 
of the term 0.999 ... the assertion follows immediately. 


2. 


The numbers o(f) of my paper [1] were defined as follows. Denote by 
f(x) a polynomial of the exact degree m > 1 which for positive integral x 
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assumes only nonnegative integral values and write 
h 


f(x) = Δ (x) = Pie {0} +h-—H) (h=0,1,2,...) 
for the successive differences of f(x). We need consider these differences only 
for 0 «ἢ <m because those with ἢ >m-+ 1 vanish identically. All these 
differences have integral values for all positive integers x, and they are 
moreover positive if x is sufficiently large. 

Without loss of generality, we impose the stronger restriction that f (x) is 
strictly increasing and positive for x > 1 and that moreover 


f(x) > 0 for O<h<mandx>1. 


It follows in particular that, for y > f(1) > 1, there exists the inverse function 
x = g(y) of y =f (x), and here also g/(y) is strictly increasing. 
A little more can be said. We can write f(x) in the explicit form 


f(x) =a ™x™(L + ayx 7 α,)χ 7 π΄ + 0,x7"), 


where ἃ is a certain positive number and a,, a, ..., ἄρ are certain real 
constants. Hence it follows that for sufficiently large y, 

g(y) = ayn + O(1). (1) 
3. 


For every positive integer k the function value f(k) can be written as a 
finite decimal 


Mx 

f(k) = Σ ἀς, 10.4.2 = ἀκοάκι °° Aye, ) 

A=0 

where the coefficients d,, are decimal digits 0, 1, ..., 9, where in particular 
dio > 0 for all k, 


and where the numbers M, are nondecreasing nonnegative integers. 
We associate now with the polynomial f(x) the infinite decimal fraction 


o(f)=0-diody, °° dim,420d21 *** dyy,430d31 50 du, "" 
By way of example, the polynomial 
(x) = x(x + 1)/2 


has the required properties, and with it is associated the decimal fraction 


σ(φ) = 0.1 3 6 10 15 21 28 36 45 55 66 78 91 105 120 136.... 
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From my old paper [1] I take a strongly convergent series for o(f). 
Denote by n the positive integer for which 


101. 1 <f(1) < 10"—-1, 
and put 
κι =9 and j, =[g(10" -- 1)] for v=nn+1,n+2,...; 


further write 


= Sai, oat 1) = (ν — 1)j,- 17 pe 


for v=n+1,n+2,n+3,.... 
With this notation 


o(f)= Dm f(kylo- ΟΣ το. δηλ Σ f (k)10-*. 


v=n+1 k=jy-itl 


Here the finite sums can be summed by means of a formula from difference 
calculus, giving the formula 


o(f)= Y fll) )(10" — 1) (h+ 1) 


-ῸΣ 10S finn + IMO! — ἡτότυ — C40" — 1) 


(2) 


On putting 


ro = Σ A(1)(1o"— 17", 


h=0 


n= assay 10 ye) 
h=0 
for v>on+l, 
Peed <x for v>1, 


the formula (2) can be written as 


o(f)=ro- Σ rlo-' (3) 
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a series of the same form as in the lemma. From their definitions, all the 
numbers ro, ἢ.» r2,... are positive and rational. The coefficients f,(1) and 
fi(jy-1 + 1) are positive integers. If further dy denotes the product 


dy = ((10° — 1)(10° — 1) --- (LOY — 1)", (4) 


then dy is a common denominator of the N + 1 rational numbers ro, 1r;,..., 
ry, and 


dy < (102+24~+ Ayn +1 — 100πῈ ὈΝΙΝῈ 19/2, 
In order to make use of the lemma, we require upper estimates for the 


numbers e, and ¢,. Such estimates can be derived from the formula (1) for 
g(y). It implies that for large v 


jy=a-10""+O(1) and J,=a(v—1)-10°°™ + O(110°" 2”), 
(5) 


This implies that there are two positive constants c and C such that for 
large v, 


εν "10" <i, < Cv: 10". (6) 
Further, for h = 0, 1,..., m, and for large x, 
fix) = O(x”), 
hence, by (5), for all such values of ἢ and for large v, 
Saliv—1 + 1) = O(10"). 
On the other hand, for such ἢ and ν, 
(105: — 1)°@*) — (0Ὁ — τὴ = O(107”). 


Therefore all the sums rp, r,, r,,... are bounded positive numbers, say not 
larger than 10? — 1 where p is some positive integer. This means that for 
sufficiently large integers N the products dyro, ἄν γι: dyrz, ... are positive 
integers not greater than 


190°"* 1)N2 __ 1, 


therefore by (6) that 
O< Σ᾽ dyr,-107*< 10°19", 
v=N+1 
The hypothesis of the lemma is thus satisfied with 
ey=(m+1)N? and ey= 10%" 


Here ἐν > Ney for all sufficiently large N, and hence the lemma leads to the 
following result. 
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Theorem The decimal fraction o(f) belonging to a polynomial f (x) of 
degree m has the following property. To every sufficiently large positive integer 
N there exists a positive integer dy of at most (m+ 1)N(N + 1)/2 decimal 
places such that at least 10%" — (m + 1)N° of the first 10%” digits after the 
decimal point in the decimal expansion of dy a(f ) are equal to 9. 
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The following theorem is proved. If g,, 92, ---» J, is a sequence of elements from a group (written 
additively) such that <g1, 92. ---> 94> is not cyclic, then either the set of all sums g; + 9;, + 
+ 9, (1 <i, <i, <-+++ <i, <k; t = 1) contains 0, or it contains at least 2k — 1 elements. 


1. Introduction 


Given a sequence of elements g,, ..., g, (possibly with repetition) in a 
group G (written additively), we shall ἊΝ that the sequence a ΈΕΗΙΣ g if g 
occurs as a sum of the form 


g=g,t°"'+9, ([Ξὴ «τ «ἢ, κ; 1} 1} 


For finite Abelian groups, the problem of determining the length of the 
longest sequence in the group that does not represent 0 has been extensively 
studied [1-3, 6], and, though solved for Abelian p-groups and for a number 
of other cases, remains unsolved in the general case. Much less is known for 
non-Abelian groups. In this paper we shall prove the following result for an 
arbitrary group. 


Theorem If the sequence g,,..., θκ does not represent 0, and if the group 
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(Gis +++» 9x» generated by the g; is not cyclic, then the sequence does represent 
at least 2k — 1 elements. 


From this theorem we shall obtain the 


Corollary If G is a finite, but noncyclic group of order n, then every 
sequence in G of length k > 3(n + 1) represents 0. 


The corollary was proved for Abelian groups by R. B. Eggleton and P. 
Erdos [4]. In the same paper, Eggleton and Erdés proved that any sequence 
of k distinct elements in an Abelian group, which does not represent 0, does 
represent at least 2k — 1 elements. 

It is interesting to note that our theorem and corollary give the best 
possible results in the case of dihedral groups, and that this is so even if we 
take sequences of distinct elements. To see this, let G be the (dihedral) group 
of order ἢ = 2v generated by a and b, where 2a = 0, vb = 0, and b+ a= 
a — ἢ. For k < v, form the sequence 


a, a+b, a+2b, ..., a+(k—1)b. 
It is easy to see that this sequence represents exactly 2k — 1 elements, namely 
a, a+b, ..., a+(k—1)b; b, 2b, ..., (kK —1)b. 


More generally, the theorem gives, for a given k, the best possible result 
in any noncyclic group G that has an element x of order at least k. For if 
y €G, but y € <x), the sequence 


5 ES, AR, (where x is repeated k — 1 times) 


does not represent 0 and does represent exactly 2k — 1 elements. 


2. Notation and Preliminaries 


If A and B are nonempty subsets of a group G, take A + B to be the set of 
all elements of the form a+ ὃ with ae A, b ε B. The notation |X| will 
signify the number of elements in a finite set X. If a,, ..., a, are nonzero 
group elements, the symbol Σ = X(a,, ..., a,) will always denote the sum set 


= {0, αι) ἘΠ {0, a,} εὐ δου {0, a,}, 


that is, the set of all elements ¢,a, - τ + &,a,, where ε; Ξε Ο or 1. Thus 
ΟΕΣ. If ea, +-°--+&a,=0 (δ; Ξξ Ο or 1) has no solution other than 
δι =°*: = & = 0, we shall say that Ὁ has only the trivial representation in =. 
By a coset of a subgroup H we shall always mean a right coset H + g. If 
contains an element of a coset H + g, we shall say that 2 represents that 
coset. 
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We shall require the following lemmas: 


Lemma 1 [f 0 has only the trivial representation in Σ = X(a,, ..., a,) 
then |X| >k +1 and equality holds only when a, = a, = """ = ay. 


3 


Proof The lemma is easy to verify for k = 1, 2. Assume that k > 3 and 
that the lemma is true for sequences of length k — 1. Since 0 has only the 
trivial representation, 0 ¢ a, + X(a2, ..., αι). Therefore [Σ] >1+ |X(a), 
..., @,)|. The lemma follows, by induction, from this inequality except when 
a, =a; ="''=a, and a, # a2. But, by symmetry, |2| >14 [Σία,, ... 
a,-,)|. Hence, in the remaining case, |X| >k + 2. 


3 


Lemma 2 Suppose <a) is a cyclic group of order n and S = {0, a, 2a,..., 
ta}, whereO <t <n—2.1f0 #he <a), then |S + {0,h}| >t +2. 


Proof Ifh lies in S, then —a+heS, but —a+hé¢S +h. If not, then 
heS+h but ἢ ἐ 5. Thus, in any case, 5. # δ +h. Since S + {0, h} = δ 
S +h, we have |S + {0, h}| > [5] =¢ +1. 


Lemma 3. Suppose <a) is a cyclic group of odd order n and S = {0, a, 2a, 
..., tal, where 2<t<n-—1. If he <a), h is not a generator of <a), and 
O¢S +h, then |S + {0,h}| >t +4. 


Proof Since <a> has odd order and ἢ is not a generator, h + a, 2a. Also 
ἢ # O since hisin S + h but 0 is not. Thus h = ra, where 3 <r <n — 1. Since 
S +h = {ra, (r+ 1)a, ..., (r+ t)a} and 0¢S +h, we must have r+t< 
n — 1. Hence 0, a, and 2a are not in S + h. Since these three elements are in S, 
it follows that |S + {0, h}| => |S+h| +3= [59 +3=¢4+4. 


Lemma 4 (Kemperman) Suppose A and B are finite subsets of a group 
απά Ὁ ΕΑ τ B. Ifa+b=0 (ae A, b ε B) has no solution except a= b = 0 
then |A+B| Σ [4] + [8] -- Ἰ. 


5 


The proof of this lemma may be found in Kemperman [5]. 


3. Proof of the Theorem 


Assume that the theorem is false and let a,,..., a, be a criminal sequence 


of minimal length k. Thus <a,, ..., a,> is not cyclic, | Z(a,,...,a,)| < 2k and 
Ὁ has only the trivial representation in X(a,,..., a,). Let H = <a,, ..., a,>, 
where ¢ is the largest index such that <a,,..., a,> is cyclic. We may assume 


that our sequence is chosen, from among the criminal sequences of length k, 
to have maximal value t. Let u be the largest integer such that <a,_,,41,.--, 
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a,» 18 cyclic. Under these assumptions we first show 
Lemma 5 Let 2 = x(a,, teeny ακ). 


(a) [Z| -- 2Κ -- 1, |Z(a,,..., q_1)| = [Σῴα, ..... a,)| = 2Κ -- 2. 

(Ὁ ΗΈΣΞΣ. 

(c) H is a finite subgroup of odd order. 

(4) Σ is the union of at least three cosets of H. 

() u<t<k—4andt<|H| -- 2. 

Proof We must have a, € (a,..., ἄχ). for otherwise a, + L(az,..., ay) 
and X(a,,..., ἀκ) are disjoint sets, and so |X| > 2|X(a2,..., a,)|. But this is 
impossible since, by Lemma 1, |X(a,,..., a,)| Σ k. It follows, in particular, 
that <a2, ..., a> is not cyclic. Thus, by the minimality of k, |Z(a,,..., 
a,)| = 2k — 2. Since 0 has only the trivial representation in Σ, 0 ¢ a, + 
Σία,, ..., a). This shows that |Z| > 1+ |ZX(a2, ..., αν}. Thus equality 
must hold in the last two inequalities. Similarly | X(a,,...,a,—,)| = 2k — 2. 
This proves (a). 

Since |X| = 1+ |ZX(a,,..., a,)| and 0 ¢ a, + Σία;, ..., a,), we must 
have that © = {0} ὦ a, + Lay, ..., a,). Thus if x eX and x #0, then 
—a, +x € X{a,,..., a,). In particular 


x eX, x#0-> --αι ἜΧΕΣ. (1) 
Similarly, 
ΧΕΣ, x#0>-x-a ΕΣ. (2) 


From (1) it follows that if x e Σ but x ¢ <a,>, then x, —a, + x, —2a, + 
x,... all lie in &. Since 2 does contain elements not in <a,>, we see that a, 
has finite order. Hence 


ΧΕΣ, x € (a, >> (a, t+ xz. (3) 


We shall show that the restriction “x # 0” in (1) can be eliminated, or, in 
other words, —a, ΕΣ. To do this we need to know that a, # αι. We shall 
show that a, ¢ <a,>. Suppose, to the contrary, a, Ε <a,>. Consider the se- 
quence a,, 41. 41. ..., @— 1. Clearly 0 has only the trivial representation in 
X(a,, 4;, 22, ..., @—,). Let γε X(ay, ..., a_,). If y εἰ (αι), then a, and y 
commute, soa,+y=y+a,e. If y ἐ <a,), then, by (3), <a,> + y SZ, so 
a, +y ΕΣ. This proves that XL(a,, a,,..., ακ-- 1) Ξ Σ. Therefore a,, a,,..., 
a, —, 15 acriminal sequence of length k and <a,, a,,..., a,> is cyclic. But this 
contradicts the maximality of t. Thus a, ¢ <a,>. 

By (1), —a, + a, € Σ. Since a, # a,, we have, by (2), that (—a, + ἀμ) — 
a, ΕΣ. Thus —a, ΕΣ. Thus (1) becomes: x € -» —a, + x € XZ. This gives 


{ay > +X =2. (4) 


Since the elements a,,..., a, Commute, we may interchange a, with any a, 
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(j = 2,..., ἢ), So (4) holds with a, replaced by a;. Thus ¢a;) + Σ = Σ for all 
j=1,. a It follows that 


H+ Z=¥%. (5) 
We see, by (5), that H is finite and that Σ is the union of complete cosets of 
H. sag || = | H|m, where m is the number of cosets of H represented by 


Σ. By (a), |X| is odd, hence |H| and mare odd. But m > 1, so m > 3. This 
proves (b ) (c), and (d). 


Clearly —a,,..., —a,, —a, isa criminal sequence, so, by the maximality 
of t, t>u. By Lemma 1, [Σία,, ..., a,)| >t+1. Thus t+1< |H|. If 
t= |H| —1, then, by Lemma 1, L(a,, ..., a,) =H. But this means that 
H + X(a,,..., a;) = Σί(α,, ..., aj), for all ε <j < k. Hence |H| divides |X(a,, 


.., aj)| for ali t<j<k. But this is impossible since t<k, [Σία,, ..., 
αι. )| = 2k — 2, and |X| = 2k — 1. Thus t < |H| — 2. 

Since 2k —1= |X| >3|H| >3t+6, we have k>t+4. This com- 
pletes the proof of (6) and the lemma. 

Lemma 6 Either (i) there is an index s such that 

|X(a,,..., a,)| > 2s +t, (6) 

or (11) t = 1 and a, = 453 = (4. 

Proof By Lemma 5, Σ represents at least three cosets of H. Let v be the 


smallest index such that X(a,, ..., a,) represents three (or more) cosets of H. 
Thus v >t + 2. Let δ = X(a,, ..., a,). If w ε X(a,44,..., a,), then S+ we 
X(a,,..-, a,). Thus Z(a;, ..., a,) contains at least |S| elements in each coset 


it represents. Since X(a,,...,a,) represents at least one more coset than X(a,, 
.., Qy_1), We have 


|X(a,,..., a,)| = |X(ay,..., a,-1)| + [S| 2 2υ -- 2 - [δ]. 


By Lemma 1, [S| >t + 1. If [S| >t + 2, we are done. Also we are done if 
X(a,, ..., a,) represents four or more cosets of H, for then 


|Z(a,,...,a,)| >2v—24+2|S| >2v-24+2(t+1)>2w4+t. 


By Lemma 1, |S| =¢ + 1 only when a, = a, ΞΞ """ =4,. 

Thus we may assume that a, = a, = """ = a, = a, hence S = {0, a, 2a,..., 
ta}, and that X(a,, ..., a,) represents exactly three cosets of H. 

Let x = a,4,. By the meaning of v, X(a,,...,a,-,) ¢ H UH +x. Thus 
we may write X(a,,...,a,_;) = A ὦ B+ x, where S ς A, BC H. The third 
coset represented by X(a,,...,a,), but not by X(a,,...,a,_,), is either H + a, 
or H+ x ταν. Thus if |A| >t+2 and |B| >t + 2, we are done and (6) 
holds with s = v. For notational convenience, we shall label the first t + 3 
elements of our sequence (recall k > t + 4 by Lemma 5) as a, ..., a, x, y, z. 
We consider five cases. 
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Casel v>t+4 In this case X(x, y, 2) Ξ Η ὦ H + x. Since the three 
elements y, z, and y + z lie in H ὦ H +x, at least one of them lies in H. 
Label that element h,. Clearlyh, #0. Thus x + h, € X(x, y, 2) Nowx + hy, 
is not in H, hence x + h, = h, + x, where 0 τ ἢ € H. Thus A 2S + (0, h,} 
and B>S + {0, h,}. By Lemma 5, t < |H| — 2, hence Lemma 2 gives | A], 
|B) >t4+2. 

Case2 v=t+3andx#y Thus 2(a,...,a,x,y)=AUB+x. Since 
x € H, we have H+x+y+#H-+4 y. Thus one of x + yand y lies in H and 
the other lies in H + x. Since neither is 0 or x, they have the form h,, ἢ. + x, 
where h,, hh, #0 and h,, h, ε H. Thus A 2S + {0,h,} and Β 3 5 + (0, hj}. 
Again |A|, |B| >t + 2, by Lemma 2. 

Case3 v=t+2andx#y _ The four elements 0, x, y, x + y are dist- 
inct, and Σία, ..., a, x, y) represents exactly three cosets of H, hence some 
two of these four elements lie in the same coset of H. Thus X(x, y) contains 
two elements of the form w,h+w, where 0#heH. By Lemma 2, 
|S + {0, h}| >t + 2, hence at least t + 2 elements of Σία, ..., a, x, y) lie in 
H + w. Since the other two cosets each contain at least t + 1 elements of X(a, 
..., a, X, y), we have |X(a, ..., a, x, y)| = 3t+4=2v +t. Thus (6) holds 
with s = v. 

Case4 v=t+3andx=y In this case x + y = 2x =h hies in H and 
h + 0. Since a, ..., a, x> is not cyclic, ἢ is not a generator of H = <a). Also 
0 ἐ 5 +h since 0 has only the trivial representation in Σ. 

If t > 1, then (since H has odd order, by Lemma 5) Lemma 3 applies, and 
we have |S + {0, h}| >t + 4. Hence |2X(a,...,a,x, y,z)| > 3t+6 = 2v + t, 
and so (6) holds with s = v. 

If t=1, then A=S + {0,h}={0,a,a+h,h}, and [4] - 4, since 
h+#0, +a. If z¢H, then [4 U A+2z| >8,and so |X(a, x, y, z)| Σ 10. If 
z € H, then 0 ¢ A + z, since 0 has only the trivial representation; therefore 
| A + {0, 2}} 5 |A| + 1 = 5. Thus Σία, x, y, z) contains five elements of H, 
and so |Z(a, x, y, z)| > 9. Thus in either case (6) holds with 5 = v = 4. 

Case5 v=t+2 and x=y Thus La, ..., a, x, x)=SUS+ 
x U δ᾽ + 2x, where 0, x, and 2x are in different cosets of H. If X(x, x, z) 
represents five or more cosets of H, then | Z(a,..., a, x, x, z)| > 5(t+ 1)> 
2(υ - 1) +4. If X(x, x, z) represents four cosets of H and t > 1, then [Σία, 
...,@,X,x,z)| >4(t + 1) > 2(v + 1) +t. So, in either of these two cases, we 
may take s = v + 1 in (6). 

Assume t = 1 and (a, x, x, z) represents exactly four cosets of H. We 
may assume that z # x, for otherwise statement (ii) of the lemma holds. 
Since z # x, X(x, x, 2) contains at least five elements by Lemma 1. Hence 
two elements of Σίχ, x, z) lie in the same coset of H. Thus w,h+ 
w € X(x, x, 2), where 0#heH. Since a has odd order, S + {0, h} = 
{0, a, h,a+h} contains at least three elements. Thus H + w intersects 
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L(a, x, x, z) in at least three elements. It follows that |X(a, x, x, z)| =>3+ 
24+2+2=9, hence (6) holds with s=v+1=4. 

There remains only the possibility that X(a, ..., a, x, x, z) represents 
exactly three cosets of H. Thus the three cosets H+z, H+x+2z, 
H + 2x + z coincide, possibly in different order, with H, H + x, H + 2x. If 
δ  ἰχ Ἐ ΖΞ δ + jx for all 0 <i, j < 2, then each coset H + ix (i= 0, 1, 2) 
intersects Σία, ..., a, x, X, z) in at least [5] + 1 =t+ 2 elements, and so 
|X(a, ..., a, x, x, z)] > 3t+ 6 = 2(t + 3) +t, which means that (6) holds 
with s=t+3=v+1. Assume that S + ix + z=S + jx for some pair i, j 
(0 <i, j <2). Then S=S + (ix + z — jx). Then by Lemma 2, ix + z— 
jx =0, so z= (j —i)x. Since the three elements z, x + z, and x+x+2z 
cannot be zero, we must have z = x or z = 2x. In either case 3x € X(x, x, 2), 
3x =he H,0 €S +h, and, since <a, x) is not cyclic, ἢ is not a generator of 
H. 

Ift > 1, then, by Lemma 3, |S + {0, h}| >t + 4. Hence H intersects X(a, 
..., @, X, X, Z) in at least t + 4 elements, and therefore [Σία, ..., a, x, x, 
z)| >3t+ 6=2(t + 3) +14. Thus (6) holds withs =t+3=v0+1. 

If t = 1, we may assume z + x (otherwise (11) holds), so z = 2x. Then by 
Lemma 1, |&(x, x, 2x)| > 5. Buta ¢ <x), soa + X(x, x, 2x) and X(x, x, 2x) 
are disjoint. Thus [Σία, x, x, 2x)| >2|2Z(x, x, 2x)| > 10, and again (6) 
holds with s = 4 = v + 1. This completes the proof of the lemma. 


It is now easy to obtain a contradiction from all of this. Suppose first that 
statement (i) of Lemma 6 holds. Thus there is an index s for which | Z(a,,. 
a,)| Σ 25 +t. We may assume, of course, that 5 <k, thus Σ = Σία,, ..., 
as) + X(a,+1,--., @). Since Ὁ has only the trivial representation in Σ, 0 has 
only the trivial representation as a sum 0=a+b, αε Σία,, ..., a,), 
δ € X(a,41,---, a). Thus, by Lemma 4, 


ΝΣ » ΙΣία,, ..., a,)| + ΣΡ ἜΤ a,) | carer 1. 


eg 


ud 


If <a,41,---, @)> 1S not cyclic, we have |X(a,41,..., a,)| = 2(k -- 5). hence 
ΙΣ] >2k +t -- 1 > 2k, and we are done. If <a,,,,..., a> is cyclic, then, by 
the definition of u,k — s <u. Also |X(a,41,...,a,)| =k —s + 1, by Lemma 


1. Thus |2| >s+k+t>2k +t —u > 2k and we are done. 

Suppose that statement (11) of Lemma 6 holds. Thus t=1 and 
a, = a, = a,. Let r be the largest integer such that (a,,a3,...,a,4;> =Qis 
cyclic. Thus r 3. Alsor+1<k since u= 1. 

We first show that X(a,,..., a,) represents at least three cosets of Q. The 
elements 0, a, + 4), —a, + a, lie in Σ, by (Ὁ) of Lemma 5. By (a) of Lemma 
5, there is only one element in Σ that is not in X(a,,..., a,). This element 
must be —a, since —a, ΕΣ by (b) of Lemma 5, and since —a, ¢ X(ap,..., 
a,) since 0 has only the trivial representation in ©. Now 0, a, + a3, 
—a, + a, are distinct from —a, since a, has odd order and a, ¢ Q. Thus 0, 
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a; + az, —a,; +a, € X(az, ..., αι). Therefore X(a,, ..., a,) represents the 
three cosets Q, Q + a, + a,, Q — a, + ay, which are distinct, again since a, 
has odd order and a, ¢ Q. 

Let v + 1 be the largest index such that X(a,,..., a,,,) represents three 
or more cosets of Q. Since a, = a3 = a, € Q and |X(az, a3, a4)| > 4, every 
coset of Q represented by X(a,, ..., a,4,) intersects X(a,, ..., Ω,. 1) in at 
least four elements. Thus |Z(a2, ..., a,41)| =4+ [Σ(α2, ..., a,)|. Since 
v>r-+ 1, <ap,...,a,> is not cyclic, hence |Z(a2,...,a,)| > 2(v — 1), and so 
ΙΣ(α;, -... Ay41)| = 2(v + 1). By Lemma 5, |Z(az, ..., a,)| = 2k — 2, so 
v+1<k. Lemma 4 gives 


|Z(a.,...,a,)| = 20+ 1+ |Lla,i2,..., a)|- 


If (ay42, --., a> is not cyclic, then |Z(a,,.,..., a,)| = 2(k — v — 1), and 
hence |2(a2,..., a,)| = 2k — 1, which is a contradiction. If (a,,,..., a,> is 
cyclic, then k —-v —1=u=1 and so |X(a), ..., a,)| = 2k — 1, which is 
again a contradiction. This completes the proof of the theorem. 


To prove the corollary, suppose a,, ..., a, 15 a sequence in a noncyclic 
group G of order n, and k > 3(n + 1). If the sequence does not represent 0, 
then |X(a,, ..., a,)| >k+1>n/2 by Lemma 1. Hence <a,,..., a, =G. 
But then, by the theorem, |Z(a,, ..., a,)| = 2k >, which is impossible. 
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Concerning a Possible “Thue—Siegel—Roth 
Theorem” for Algebraic Differential Equations 
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Let K denote a field of characteristic zero. Let y, denote a formal power series about z = 00 with 
coefficients in K which is a solution of a nonzero algebraic differential equation having its 
coefficients in K. Let ord denote order of vanishing at z = 00. We show that for each ε > 0 there 
exist a constant c(&) > 0 and a nonzero algebraic differential equation A ,, having coefficients in Καὶ, 
such that for all r, s = 0 in K[z] either, (1) ord(y, — rs~') < (2 + €) deg 5, (ii) deg s < c(e), or (iii) 
rs—' is a solution of A,. In a number of cases both c(e) and A, are effectively computable. If one 
could eliminate case (iii), a very strong statement of approximation would follow, a statement 
analogous in many ways to the Thue-Siegel-Roth theorem. Inequality (1) is best possible in the 
sense that, as one may easily show, given any formal series y, with ord y, =0 and any positive 
integer n there exist polynomials r and 5, with deg s <n, such that ord(sy — r) > n > deg 5. 
(Consider the rank of the associated system of linear homogeneous equations.) 


Introduction 


Kolchin, in [2], suggested the problem of proving an analogue of the 
Thue-Siegel-Roth theorem for (at least) all formal power series y,, with 
coefficients in a field K of characteristic zero, that satisfy an algebraic differ- 
ential equation. He obtained what may be thought of as a “Liouville type” of 
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bound on the “diophantine approximation” of such formal series by rational 
functions. In two recent papers [3, 4] the present author used the methods of 
differential equations in order to obtain effective bounds on the “diophan- 
tine approximation” of algebraic functions over fields K of both zero and 
positive characteristic. The present results represent a considerable generali- 
zation of the methods used by Osgood [3] and Osgood [4]. We obtain below 
an incomplete solution to the Kolchin problem, as we understand it and 
have interest in it; i.e., for each power series y, and every ¢ > 0, our analogue 
of the Thue~Siegel-Roth theorem holds, for all rational functions not satisfy- 
ing a certain algebraic differential equation, P, = 0. Here P, + 0 is a differ- 
ential polynomial depending upon both the differential equation satisfied by 
y, and upon 8. In many cases P, is effectively computable. 

Since our method could be used to construct a number of (potentially) 
distinct differential polynomials P,, one would hope that future work in this 
area might show} (sometimes even effectively) that beyond some point the 
only common zeros of all of these differential polynomials P, are the solu- 
tions to our original algebraic differential equation for y,. Then if, say, y, is 
not a singular solution of our original differential equation for it (which we 
shall have to assume below anyway) the argument of Kolchin in [2] would 
suffice to complete a (noneffective) proof of our analogue of the Thue- 
Siegel-Roth theorem. The effective approach used in [3] by the present 
author might then even allow one to obtain this latter result effectively, in 
certain cases. 

Finally, the approach of this paper can be carried out in more general 
contexts. In particular, in Theorem II below, we consider what can be shown 
over an arbitrary differential field of characteristic zero. We have no interest 
here in other valuations than ord. Extensions to other valuations are likely. 
(Presumably, with minor changes, the proof would also go through for fields 
of positive characteristic. This is of less interest, perhaps, since there exist 
algebraic differential equations, such as y'?) = 0, which have in their solution 
set every formal power series with coefficients in any field of characteristic 
p> 0.) 


Definitions Let K denote a field of characteristic zero. Let z be tran- 
scendental over K. Let A denote a nonzero algebraic differential equation 
over K, ie, A is of the form a(y, γί"), ..., y™, z)=0 where each 


+ (February 1976) As stated, this now appears unlikely to me; however, it would be enough 
to show that y, is not a limit point (in ord) of the set of common zeros of all of these differential 
polynomials. 
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γ᾽ = (d/dz)y, 0 <j < οὐ, nis a nonnegative integer, and α(ν, y?, te γι", Ζ) 
is a not identically zero polynomial in n + 2 variables with coefficients in K. 
Let y, denote a (generalized ) formal power series solution of A about z = oo, 
1.€., Ne 


7 x 
y= Laz" 
1=0 ; 
for some integer m and a set of elements a, in K. Suppose that y, is not a 
solution of da/dy” = 0. Let ord f denote, for any formal power series fabout 
z = oo, the order of vanishing of f at z = oo. 


Theorem I (i) For each ¢ > 0 there exist a nonzero algebraic differen- 
tial equation A, with coefficients in K and a positive constant c(&) such that 


ord(y, -- γ5 ἢ) « (2+) deg s (1) 
for all polynomials r and s + 0, in z, with coefficients in K, such that rs“ ' is not 
a solution of A,, and deg 5 > c(é). 

(1) If K equals the rational field, then A, is effectively computable, and if 


we know, in addition, a lower bound on ord y,, then c(é) is also effectively 
computable. 


The following theorem strengthens Theorem I in a rather surprising 
manner. 


Theorem I' Suppose, under the hypotheses of Theorem 1, that addi- 
tionally ord y, > 0, then Theorem I holds (for a potentially different A,) with 
c(e) = 1. If K = Q, the rational field, and if there exists an effective algorithm 
for deciding if y, satisfies a given algebraic differential equation with 
coefficients in Q, then our new A, is effectively computable. 


(To handle the general case multiply y, above by z~™”, where 
—M <ord y,, and apply Theorem I’ to z γι, where it is being approx- 
imated by z “rs~'.) 

Theorem I’ holds out the (remote) possibility that the constructions in 
the present paper might someday be used to prove an inequality of the form 
ord(y, — rs~') < 2(deg 5) + n(y,), for some n(y,) > 0 and with no excep- 
tions, since one may now choose ε > 0 after a specific rs" 1 has been chosen. 
On the other hand, if deg s > 1, απά we choose € < (deg s)~', then (1) would be 
false infinitely often (see the abstract). The (hopefully not usual) case that rs~ ἢ 
satisfies A, is forced upon us under these circumstances. 

Theorems I and I’ will follow easily from a more general result: 


Definitions Let F denote a differential field, of zero characteristic with 
derivation 6. (For these definitions, see Kolchin [1].) By a nonzero differen- 
tial polynomial P over F, we mean an expression of the form P(y, γ᾽), ..., 
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y) where y is a differential indeterminate (see Kolchin [1] again) y = dy, 
0 <j < οὐ, the parameter n is a nonnegative integer, and P(y, y"?, ..., y™) is 
a polynomial in n + 1 variables over F. (The extension of this definition to 
more than one differential indeterminate is straightforward.) 


By the “denomination” of a differential polynomial P = P(y, y, ..., y) 
over F we shall mean the smallest nonnegative integer d = d(P) such that 
y*P(wy”*, ..., (wy 1)) is a differential polynomial in the two differential 
indeterminates w and y. By the principal differential ideal (in Fly, γί"), ..., 
y”, ...]) generated by a differential polynomial we shall mean the smallest 
(algebraic) ideal in F[y,..., y, ...] that both contains « and is closed under 
differentiation (as we shall refer to the action of the derivation). We denote 
this ideal as («). Similarly, if our ring is F[y, y?, ...; X,, X9P, ...5...5 ΧΙ, 
XW, ...] where y, X,, ..., X, are distinct differential indeterminates, we 
denote the corresponding ideal as (a)F[X,, ..., X,]. 


Theorem II (i) For each nonzero differential polynomial a = a(y, ..., 
y™), with da/dy™ +0, and each & > 0, there exists a positive integer @, a 
nonzero differential polynomial P, € Fly, y‘”, ...], and a real number 0, > 1 
satisfying d(P,)0, ' <2 + ε, such that each 


ΠΣ 


for all sequences of nonnegative integers eg, e,,... with δ᾽; e i < Ὁ. 
(ii) Further, one may effectively compute P, if F is the field of rational 
functions in one variable with rational (number) coefficients. 


We wish to see that Theorem I follows from Theorem II. By Theorem II, 
given ὃ > 0 there exists (or we may effectively compute) a differential poly- 
nomial P,,. such that each 


2) a) 


Σ᾿; 6, < 9,;2, where θ.,2 is such that d(P,,.)(0.,2) | < 2 + /2. Then we see 
that the (finite) Taylor series expansion of P,. in powers of (y — γι), 
(y — yi)”, ... has zero coefficients for all terms of total degree less than 0, 
in (y— y,), (y—y,), .... Let A, be P,. = 0. If 


Py2((r/s), (1/s)™, ...) + 0 


then, since ord of a nonzero rational function is at most the degree of the 
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denominator, 
ord Ρ τίν 5, (r/s)”, ...) < d(P.,2) deg 5. 
Suppose ord(r/s — y,) > 0 and 
ord(r/s — y,) > (2 + ε) deg s > (d(P,;2)(0,;2) * + 38) deg 5. 
Then we would have that 
6.2 ord(r/s — y,) > d(P,,2) deg 5 + 5e(deg 5) 
> (ord P,,2(r/s, (r/s), ...)) + ze(deg s), (2) 


which leads to a contradiction for all s of sufficiently high degree. The bound 
on the degree of s depends upon a lower bound on the order of vanishing at 
z= οὐ of the coefficients of the (finite) Taylor series for P,,. in (y — yj), 
(ν — y,), .... If P,/2 is effectively computable and if we can bound ord y, 
from below, we can place an effective upper bound on the degrees of those 
polynomials s for which there is no contradiction. This proves Theorem I, 
assuming Theorem II. 

We shall next see that Theorem I’ also follows from Theorem II. Where 
a = is the original differential equation for y, suppose 0a/dz = 0. Then, 
taking F = K in Theorem II, we have éP,/dz =0. If, also, ord y, >0 
inequality (2), above, now implies a contradiction regardless of the degree of s if 
deg s > 1. We shall next show that there exists β + 0 in («) with 0B/dz = 0. 
[Since y, is a zero of B, obviously B ¢ Κα. One may then proceed in our 
present case to take successive partial derivatives of this 8, with respect to 
the highest derivative of y appearing in it, finally arriving at a (possibly) new 
B= Bly, y, ..., y™) such that B(y,)=0 but (06/dy™)(y,) + 0.] Our 
procedure below will be effective under the hypotheses of Theorem I’. We 
proceed. by induction on N = deg, «. If N = 0, there is nothing to prove. Set 
a=) ji z/a;, where N > 1, each da,;/dz =0, and ay +0. If ay € (x) we 
have nothing to prove. If ay ¢ (a), then aay’ ¢ K (otherwise ay = a(aya™ ') 
would be in (α)). Using the chain rule, we see d(aay ')/dz has degree in z at 
most N — 1 and is nonzero. Thus αὖ d(aay ')/dz + 0 is in (α) and has degree 
in 2 at most N -- 1. This proves Theorem I’. 


Let u, v, and y be differential indeterminates. Let ¢, be chosen, 
0 «ε, < 1. Inthe remainder of Section 2 we shall show how one may construct, 
for each sufficiently large positive integer 0, a polynomial Q in the 26 variables 
uu, ..., u®- Pv, vo, ..., v8, with coefficients in F, having the following 
properties: 


(i) The total degree of each monomial of Q inu,...,u%~, v,..., v@~ Ὁ is 
at most 20. 


228 CHARLES F. OSGOOD 


(ii) There exists a positive integer p such that at v’ = (uy), j = 0, 


1,..., each 
Oa \°/ oa \°° ἜΣ 
ES) EY θεω 


if 
θ-1 
j=0 
Lemma 1 The number of solutions in nonnegative integers of 
Y 
Σ, δ) Ξ 8 
j=1 
for positive integers y and B is (’*2;"). The number of solutions of 
Y 
Σ 6;,<B 
j=l 
is 075) 


Proof By placing y — 1 “cuts” in a sequence of B marks we define a 
unique solution above. This is equivalent to choosing y — 1 marks out of 
y + B — 1 marks to be “cuts.” Letting 6,4, be 


β- Σδ 


we see that ("7 δ) is the number of solutions of 


Lemma II_ If « has total degree d>1 in y, y",..., y, ..., y™, where 
n > 0, then for each positive integer @ every 


2(@- 1) 
[1] γ. Vaya, 
y 


may be written modulo (a) as a linear combination over F of monomials in y, 
., y”, each having total degree < 46d. 


Proof If j=0, 1, ..., n, there is nothing to prove. Let 6, denote the 
derivation which agrees with 6 on F and is such that 6, y) = 0, j = 0, 1,.... 
Now if n + 1 <j < 0 we see that, ἠδὲ the chain rule j — n times, 


n1T Oa.\ δα 
j = y — [Σ Σ γι: ᾿Ξ δρῶ" ΝΠ dX (a a ae 1) maT 


dy 
Oa \~ 0 a 
“ay " Jet) 7 
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while not even in Εν, γι"), ..., y, ...] is such that 


Co 2(7----1 
(5) ΓΙ e (x). 


Then each 


2(@— 1) 
(5 y), 0<j<0-1, 


may be written modulo («) as a linear combination over F of monomials in 
y, yD, ..., y having total degree less than 40d. This proves Lemma II. 


Using Lemmas I and 11 we wish to see that we can construct our above- 
mentioned polynomial Q. We need to solve a system of homogeneous linear 
equations with coefficients in F, nontrivially. Therefore, we must show that we 
have more variables than equations. Our variables are the coefficients of the 
distinct differential monomials in u, ..., u@~, v, ..., v°~ (potentially) 
appearing in Q. We have (7°3,2%) = (36) such coefficients by Lemma 1. The 
number of distinct monomials in u, ..., u®~’ which we have upon setting 
v™ = (uy) after O<j<O@(l1—¢,) partial  differentiations, ἰδ 
(79— J+) = (3% 4). The number of distinct partial derivatives of order j = 1 is 
less than (6 ἢ by Lemma 1. Thus the total number of monomial coefficients 
which need to be equated to zero is at most 


a1 61) (30 —j\ [0-} 
Σ (Pe 7") 
j=0 J 
Set p = 20(20 — 1) < 46? and apply Lemma II. Equating each of these mono- 
n+1+ 802d 


mial coefficients to zero leads to at most ("τὶ -) homogeneous linear equa- 
tions with coefficients in F, using Lemma 1. 


Lemma ΠῚ For all θ larger than some effectively computable integer, 


(n+ 1) + 807d os 36 —j\(O+j 2 40 
n+1 j= θ j 20 
Clearly Lemma III will suffice to show the existence of Q. If the field F is 
the field of rational functions over the rational numbers, we may solve the 
above system of linear homogeneous equations effectively by putting the 
matrix of coefficients in triangular form.t Clearly a lower bound on θ may be 


effectively computed above in terms of δ, and bounds on the order of a and 
the degrees of α in y,..., y”,..., y™, for all differential fields F. 


+ That is, where (a, ,) has rank n, each a, , = 0 if i >n ori <j, while a, ,---a,, £0. 
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Proof of Lemma III If @ is sufficiently large 


(n+ 1) + 867d = g2nt3 
n+1 


Σ, " - ' (’ + 
ἼΞο \ 9 J 


will be shown to be less than 


0 OQ 7 


where j, 1s the least integer such that j, >, 0. First 
30 —j\(8+j\  (30—-(j—1)\(0+j—1\(26-j + 16+) 
0 6] θ θ (30 ~j+ 1)j 
prays her! 1 4 220 τ1--ἢ 
7 θ θ (86 —j + 1)j 
30 —(j—1)\/@+ )-1 
[7 


for j = 1, 2,..., θ. It follows that 


μ᾿ 30-- j\(O0+j 

Re 

» +) (8 i) ete) i ῃ i) [ τ: ) 
[γ 1 — j}(20)~ 7)? 


Now 


< 01 — Ges)? ¥(5) 


From the expansion of (1 + 1)?", we see that 


2S [ἢ > (2n + 1) 127". 


20\* 4, . (40 
fl <2 < (3) 49 τ ἢ 


Thus 
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from which we see, 


were -- β00}}}2] < 50790 - βου». 


Lemma III now follows. 


In what follows u, v, y, and Y are differential indeterminates. Now 
Q(Yu,..., (Yu)®"», Yo, ..., (Yo)? ) 


may be rewritten, where ep), ..., €g. 1 are nonnegative integers, as 


9--1 
Σ. οοἠἰδοινοςον δὴ ὴκοδδοοο ἀθρόον 1] Ὁ}. 
j=0 


(€0, .--, €a-1) 


Notice that at 
v=uy, ..., υϑοῦ - (}ν)ὴ } [orv=uy], 


each 
θα Ψ θ--.Ἠθ᾿, θ- 1 
ay q(eo greey Co1,U,..., u' : Dy +++, v' ’) Ε (α) ΕΓ] 
since, at v = uy, 
oa \® (9-1) (@— 1) 
spin) OY... (Yay, Yo, (70). ε (@) FLY 
c (a)F[u, Y]. 
Lemma IV_ If 


6-1 
Σ, ἢ Ξ O(1 -- ει} 
j=0 


δα \? ὃ. \fo ὃ fo-1 


X Glo, +++) Cp_-4, Uy... UP, v, ..., VO?) € (a) F[ul, 


each 


at v = Uy. 
Proof Lemrna IV follows if upon applying 


Ca \? ὃ. \fo ὃ 59-ἰ 
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to 
QO(Yu,..., (Yu)@~?, Yo, ..., (γνγθτ 9) 


and setting v = uy, 
we obtain an element of («)F[u, Y]. Using the chain rule for partial differen- 
tiation, we see that what we have above is 


δα 5 9! (91 (ky ὃ VS 
Wendie (KJ)  ! ὃ΄όΠ 
i I ΙΣ (| )y oP) 


x O(Yu,..., (Yu)@"”, Yo, ..., (Yo) 9). 
This last quantity belongs to 
(a)F[u, Y] at v=uy 


since, if 


yy < (1 — ει} 


for each choice of nonnegative integers k,, 


δα \9 91 / ὃ γ 
(si) 10 (aoe) 
x O(Yu,..., (γι) 6: ἢ), Yo, ..., (Yo)@") ε («)F[Yu] at v=uy. 


This proves Lemma IV. 


We shall next find P, and show that it has the desired properties—thus 
concluding the proof of Theorem II. From Lemma IV we see that each 
nonzero 


6-1 6-1 
Gps ese paGy host Ps cage) 
satisfies the same hypotheses as 
9-1 θ-- 
ΟἿ κὸν oy ceca or 2), 


Further, each is homogeneous, 1e., every monomial has the same total 
degree (in the derivatives of u and of v). Then, without loss of generality, we 
may assume in what follows that Q is homogeneous. We have either 


Q(Yu,..., (Yu)?" ?, (Yv), ..., (Yo) ?) 
= ποίους τ calgo?), 
for some nonnegative integer m < 26, or there exists a nonzero 


6-1 6-1 
Gees ca ὄρει πρρ ες τ ἰβρα ἢ 
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with e, +:*: +e _, > 0. In the latter case replace 
OG... τ 
by a nonzero 
Geos ---» 6ρ-. 1.» tl, aks ie tue =) 


with e, +°*: +e -, >0 and iterate the above argument. After at most 
267(33) steps (the number of differentiations occurring in our original poly- 
nomial Q is less than 267(3$)) we must arrive at a new polynomial Q such 
that 


O(Yin, ..., (σι) τ Ὁ, Yo, .οον (ὑυ)6- 1 
-- Υπο(μ, .... τὸ ie cg OO »), 


for some nonnegative integer m < 20. 
Then we have 


Oeil OY 


= u"Q(1, 0,..., 0; (v/u), ..., (v/uyO ”). (3) 
Replace v/u by y, set εἰ = ξε, and set 
O10... Oy SS Peay SP (4) 


We must check that P, satisfies the conditions stated in Theorem II. Now 


n(x \°(a\% ( ὃ 4! ᾿ 
oC 


du \*(a\e ( ὃ \ert 
~ Vay} Vay) ἰὄγθτυ 


x Olu, ..., u@—), uy, ..., (uy)@ ?) 


9-1 {0-1 ej 
= (eve) LC, Grae) 
oy™ j=0 \k=0 VJ év™ 
x Olu, ..., u®@—Y, v, ..., v®-), 
at v = uy. By the construction of Q the final quantity above is in («)F[u} if 


θ-1 
0 


j= 


It follows that each 


da. \"{ 0 \°° ἘΝ il - 
(S55) (5p) (Gyan) Peeve 
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if 26 e; < (1 — ει). Notice that by (3) and (4) the denomination of P,, 
Le., d(P,), is at most μι < 20. Hence, setting 0, = θ(1 —«,) > 1, we have 
(assuming without loss of generality that 0 < ε < 1) 


d(P,)0,' <2(1—e,)°'<2+3e,=2+¢6. 


If the field F equals the rational functions over the rational numbers, our 
construction was effective. This proves Theorem II. 
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The Minimum Discriminant of Seventh 
Degree Totally Real Algebraic Number Fields 
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The minimum discriminant of seventh degree totally real algebraic number fields is computed. It is 
20 134 393 and is attained only by the field Q(p), where p is a root of the monic irreducible 
polynomial x’ + x® -- 6x> — 5x* + 8x3 + 5x? — 2x -- 1. 


We determine the minimum discriminant essentially in the same way as 
J. Hunter did for quintic fields [1]. Two modifications and a fast computer 
enable us to handle the large number of polynomials to be considered. The 
modifications consist of a better bound for the polynomial coefficients 
(Theorem 1) and of a new method of Zassenhaus [6] for determining integral 
bases. Most of the underlying theory is developed by Pohst [4]. 

Let F be a totally real algebraic number field of degree n = 7. Then 
F = Q(p) holds, where p is a root of the monic irreducible polynomial 


f(x) =x’ +a,x®° +--+ + a6x +a, € Z[x]. (1) 
Moreover, f must have only real roots. We assume that we already know 
that dy = 20 134 393 is a discriminant of some field F. So we shall prove that 


there is no smaller discriminant ἀρ. In the course of our considerations it 
will also become clear how to obtain dy for a start. 
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We must show that there is no polynomial f the root p of which generates 
a field F = Q(p) with a discriminant ἀρ < ἀρ. The underlying idea is the 
following: If such a polynomial f exists, then there is also a polynomial f with 
a root p and F = Q(f), and the coefficients of f satisfy some inequalities 
|a;| Ξ 5; with suitable bounds S; (i = 1, ..., 7); that is, we must consider 
only finitely many polynomials which can be done by a computer. So at first 
we must derive bounds S; which of course will depend on ἀρ. 


Theorem I Let F be a totally real algebraic number field of degree 7 and 
discriminant ἀρ < dy. Then there is a generating element p, F = Q(p), p aroot 
of a polynomial (1) with 

jtr(o)| = Jai] <3, (2) 
tr(p*) <4 Ὁ (Stde)" <3 + (6140). (3) 


From Theorem I we obtain bounds S,, S,, 87 by simple calculations 
involving only the inequality of the arithmetic and geometric means. 


a,€ {0, 1, 2: 3}, 
-ἰ({ + (5 40}} — aj) < a, < —4(8 -- a?), 
[α] < ((aj — 2a,)/7)"", a, $0. (4) 


Unfortunately Theorem I does not yield results for §; with 3 <i < 6. We 
get those bounds from the fact that fmust have only real zeros. Then also f 
has only real zeros and the discriminant of f is positive: 


[43 + gya,(2a? — Taz)| < 72,/4(3a? — 7a). (5) 


Since it is difficult to compute discriminants of polynomials of degrees n > 4, 
we transform the property of finto a statement about a quadratic form. 


Theorem IT Let f € Z[x] be given as in (1). The polynomial f has only real 
roots if and only if the symmetric matrix A = (a;;) € GL(6, Z) with entries 


3 
αἱ; = j(7 —= i)a;a; <= pC + Ϊ — ))αι κα; κ 


(ij lye 6s Tee Gy = 1. 220 ifm> 7 orm<0) 
(6) 
is positive definite. The value of the discriminant D, of f is given by 
D, =7~° det A. (7) 
Theorem II is very important for two reasons. First, we get bounds for 


coefficients of ffrom the fact that the matrix A must be positive definite, i.e., 
all principal minors of A must be positive. Thus the two-row principal minor 
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of A yields an upper bound for a,. If a, < 0 holds, we get a lower bound for 
a, by Newton’s inequalities 


7\? ne 
ve P= 1.36; = 1). 8 
ἢ αν ναι «{ 1 {||} 6 ( ὭΣ. 


For a, = 0 (a, > 0 does not occur), we use the fact that the discriminant of 
f® must be positive too: 

ai -- (Paya + 48at)az — hatada, -- ξξαὶ -- Hatas>0. 0) 
From (9) we obtain a lower bound for a, by determining the smallest zero by 
an improved Newton’s method. An inequality of Hermite provides bounds 
for as: 


(35a; — 9a,a4 + 2a,a3)? < $(21a, — 12a,a3 + 543) 
x (25a,a5 — 20a,a4 + 9a3). (10) 
Further, the three-row principal minor of A must be positive. This 1s equiva- 
lent to 


251 Ga1 Gan — A241 055 — A220 
21931432 31422 324114 
42a, < 1202 — 14a, a4 — 28a, ay + 022231432 — 991422 ~ A32411 


(11) 
Since the four-row principal minor of A must also be positive, we obtain an 
inequality a2 + r,a2 + ra. + rz > 0, where ther,, r,, 753 are rational num- 
bers depending on a,,..., ας, a7. This gives us a lower bound for ας in the 
same way as for a,. So at last we have found estimates for all coefficients of 
the polynomials f and must consider only finitely many of them. 

The second important property of Theorem II hinges on the possibility 
of computing the polynomial discriminants by (7). If we compute det A by 
Cholesky’s method, namely, we examine at the same time whether f has only 
real roots. Furthermore, we need the values D, for determining the discri- 
minants d, of the fields F generated by some root p οἵ ἢ, and ἀρ differ at 
most by a square: 


2 
411422 — ar, 


and ἢ, =d, holds if and only if 1, p, ..., p® form an integral basis of 
F = Q(p). 

We give a short description of the corresponding computer program. It 
Starts with seven Do-loops in which the possible polynomials fare generated 
as linear integer arrays (a,,..., a7). A subroutine determines the polynomial 
discriminants D,. Since many discriminants are greater than 2°* they do not 
fit into one word on an IBM machine. Fortunately, all ἢ, are smaller than 
10'*; we therefore carry out the calculation of D, in double precision. Note 
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that Cholesky’s method is numerically stable. The remaining polynomials f 
have only real zeros and D, > 0. Then another subroutine tests whether (15 
irreducible. (The common algorithms are described by Knuth [2].) All re- 
ducible polynomials fare eliminated. Afterward D, is factored into a square- 
free part and squares of prime numbers. Since D, is a double precision 
number, this has to be done in an adequate way by an assembler subroutine. 
There remain 3984 polynomials f with D , = ab? (a, b € N, a square-free) and 
α «ἀρ. 

The determination of the field discriminants ἀρ corresponding to the D, 
is carried out in three steps. Let ἢ, = ab” again, but a not necessarily 
square-free. The task is to examine whether ἀρ = a < ἀρ holds. 


(a) We can disprove d;=a in many cases in the following way. If 
ἀρ = a < ἀρ holds, there would exist a generation of F in the form F = Q(p) 
with a smaller value of tr(p*) by Theorem 1, i-e., there would exist a polyno- 
mial f, [(ΡῚ) = 0, with a greater coefficient Z and a|D;. This f would also be 
among the 3984 polynomials. If there is no such f, we can eliminate f. 

(Ὁ) By a method of Nagell [3] we determine the powers p™ (m € N) of 
prime numbers p < 7, p|b, which also divide ἀρ. Essentially we construct an 
integral basis of F with respect to these primes. 

(c) For prime numbers p that are greater than the field degree we apply 
a strong method of Zassenhaus [6]. In Z[x] we compute the polynomials 


Do = gcd(f, 7) mod p, Co =f/Do mod p, 

C,=gcd(f, Co) mod p, D, = gcd(CoDo — f, C,) mod p. 
For ἢ, = 1 we obtain p } (D, /d;) and D, #1 yields p”|(D,/d,). 
By means of these three criteria we finally get 


Theorem II]. The minimum discriminant of seventh degree totally real 
algebraic number fields is 


do = 20 134 393 = 71 - 283 583. 


Remark Actually we start our computations without knowledge of dy. 
Since only the lower bound for a, depends on dy, we carry out the program 
for a, € {0, 1, 2, 3} and the corresponding greatest values for a, from (4). 
Then we proceed replacing a, by a, — 1 until we find a polynomial f the 
roots of which generate a totally real field F, of degree 7 and discriminant 
d,. The discriminant d, yields a lower bound for a, by Theorem I. Every 
time we find a field of smaller discriminant we get a better lower bound for 
a,.In fact dy is already obtained after a few steps. 


It remains to show that there is—up to isomorphism—exactly one field of 


DISCRIMINANT OF SEVENTH DEGREE ALGEBRAIC NUMBER FIELDS 239 


discriminant ἀρ. Namely, we obtain 12 polynomials f with Ὁ, = do: 
fi (x) = x7 + x® — ὀχ — 5x* + 8x? + 5x? — 2x - 1 
fo(x) = x7 + x® — ὀχ — 4x* + 10x? + 4x? — 4x - 1 


fio(x) = x7 + 2x® — 8x? — 18x* + 7x? + 23x? -χ-- 7 


) 
βα)- x7 + x® — 8x5 -- 12x* + 6x3 + 12χ2-- 1 
falx) = x7 + x© — 10x35 — 15x* + 20x* + 45x? + 18x -- 1 
fs(x) = x? + 2x® — 5x3 — 8x* + 5x? + 6x? —-x-—1 
fo(x) = x7 + 2x® — χ᾽ — 8x* + 8x? + 7x? — 3x - 1 
fa(x) = x7 + 2x® — 6x5 — [1χ΄ + 8x3 + 10x? — 4x — 1 
fg(x) = x7 + 2x® — 7x5 — 12x* + 11x° + 10x? — 5x -- 1 
fo(x) = x7 + 2x® — 7x5 — 13x4 + 11x? + 14x? -- 8x — 1 

) 

) 


( 
fig (x) = x7 + 3x® — 4x° — 11x* + 8x? + 7x? — 6x + 1 
ial 


Let x; be a root of ἢ (i= 1,..., 12). Then x, = x; 15 immediate. To prove 
that all 12 fields are isomiOrphicl is rather corn peaieee) Extensive calculations 
by a computer show that they are indeed: 


x) =x!’ + 3x® — 5x> — 17x* — 5x? + 9x? + 6x + 1. 


X_ =x +x? — 6χῇ — 5x2 + 8x? + 4x, -- 2 
Xg=x?-—2 
Yee x, 


Χη τε x8 +x) — 6xt — 4x3 + 8x? +x, -- 2 


Xg=—X}? -- χἕ + 5x} + 4x2 -- 4x, —2 
Xo = -- χ + 6x} — x} — 8x7 + 2x, +2 
Xi2 = -- ᾽χῦ + 6x{ — x? — 7x? + 2x, 


x3 = x3 — 5x3 + 2x, 

X19 = - χϑ + 6x5 — 7x4 -— x74 1 
Χιι = X39 4+-x5 — 5x3 — 5x2 4+2x,42 
χς = Χῇ — 6xg + x3 + 4x2 — 3χρ. 


Theorem IV There is—up to isomorphism—exactly one totally real alge- 
braic number field Fy of degree 7 and discriminant dy, = ἀρ F 9 is generated 
by a root of the polynomial x’? + x® — 6x° — 5x* + 8x? + 5x? — 2x — Ἰ. 


240 M. POHST 


Remark In the meantime I also proved that the galois group of Fo is the 
symmetric group GS, and that the class number of F is 1. For the proofs and 
a system of fundamental units of Fo, see Pohst [5]. 


All computations were carried out on the IBM 370/165 of the Kernfor- 
schungsanlage Julich. 
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On Absolutely Irreducible 
Representations of Orders + 
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Absolutely irreducible representation modules L of orders H over dedekind domains R with quo- 
tient field K are investigated. In particular a set of representatives of the R-classes contained in the 
K-class of L is described and the finiteness of the class number of L is investigated. The lattice of 
submodules of L is studied and the case where this lattice is distributive is characterized. This 
distributivity is necessary and sufficient for the class number of L to be finite in case the class 
number of R is finite and the residue class fields of R are infinite. The behavior of L under finite 
extensions of R and some examples of Z-representations of finite groups are discussed. 


1. Introduction 


Let R be a dedekind domain with quotient field K and let H be an 
R-order. An H-representation module is defined as an H-module that is 
finitely generated and torsion free as an R-module. Two H-representation 
modules are R-equivalent if they are (H-) isomorphic, and they are K- 
equivalent if their tensor products with K are KH-isomorphic. (We write 
KH instead of Καὶ @p H.) Under certain conditions imposed on R the 
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Jordan-Zassenhaus theorem [5, 16] assures that every K-equivalence class 
of H-representation modules splits into finitely many R-equivalence classes. 
In this paper we shall restrict our attention to absolutely irreducible H- 
representation modules. In Section 2 we describe a set of representatives of 
the R-classes lying in a given K-class of such a representation module (see 
Theorem 2.5). We show that a certain local-global principle is valid (see 
Proposition 2.6), which was already established by Maranda [10] in a 
slightly different way. Finally, we give two new proofs for the finiteness of the 
class number, one for general orders and one for group rings RG, where G 15 
a finite group. 

In Section 3 we investigate the lattice of submodules of an absolutely 
irreducible H-representation module L. In particular we characterize the 
case where this lattice is distributive (see Theorem 3.19) and discuss some 
restrictions on the lattice which can be derived from our proofs of the 
finiteness of the class number. Necessary and sufficient conditions are given 
for the class number of L to be finite. 

In Section 4 we study the behavior of absolutely irreducible H- 
representation modules under finite extensions of the dedekind domain R. 

In Section 5 finally some examples of absolutely irreducible Z- 
representations of finite groups are discussed in order to give some applica- 
tions and illustrations of the developed tools. 

The author is greatly indebted to Professor H. Zassenhaus for many 
inspiring discussions and constant encouragement. He hereby wants to 
thank him. 


Let R be a dedekind domain with quotient field K and H an R-order. If 
two H-representation modules M and L are K-equivalent, then there exists a 
submodule N of L that is R-equivalent to M. For let φ: ΚΜ — KL be a 
KH-isomorphism. Because M is finitely generated as R-module there is an 
re R,r#0 with re(M) Ξ 1. But rp: M > rg(M) & L via x > re(x) is cer- 
tainly an H-isomorphism. So one can choose N = re(M). N has the follow- 
ing properties: 


(i) N is a H-submodule of L; 
(1) L/N is an (R-) torsionmodule; or equivalently 
(1) Land N have the same R-rank. 


Definition 2.1 A submodule N of an H-representation module L with 
the properties (i) and (ii) is called a centering of L. The set of all centerings of 
L is denoted by 3(L). (The term “centering” or “centered lattice” is used in 
crystallography.) 
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We have shown above that a set of representatives of the R-classes into 
which the K-class of L splits can be chosen as a subset of 3(L). We ask, 
When are two centerings M and N of L R-equivalent? Let φ: M > N be an 
H-isomorphism. Then @ can be extended to an KH-isomorphism of KM 
onto KN, 1e., a KH-automorphism of KL, because KM = KN = KL. This 
proves the following lemma. 


Lemma 2.2 Two centerings M and N of L are R-equivalent if and only if 
there exists a KH-automorphism φ of KL with p(M) = N. 


We assume now that L is absolutely irreducible. Then by Schur’s lemma 
the KH-automorphisms of KL are just the multiplications with nonzero 
element of K. If R is a principal ideal domain, this remark and the invariant 
factor theorem suffice to prove the next proposition. 


Proposition 2.3 Let R be a principal ideal domain and L an absolutely 
irreducible H-representation module. A system of representatives of the R- 
classes lying in the K-class of L is given by the set R(L) of all centerings of L 
that are not contained in rL for any nonunit r of R. 


Proof Compare Plesken [11] or Plesken and Pohst [12] for special 
cases. The proof follows from the more general Theorem 2.5. QED 


If R is not a principal ideal domain, the most natural approach seems to 
be to work with ideals instead of numbers. 


Definition 2.4 Let L be absolutely irreducible. M, N € 3(L) are called 
ideal equivalent if there is a (fractional) ideal 2 of R with M = YN. 


Hence R-equivalence implies ideal equivalence and ideal equivalence 
implies K-equivalence. It is immediately clear from the definition that the 
products YIM yield a system of representatives for the R-classes in 3(L) 
where YI runs through a system of representatives of the ideal classes in R 
and M through a system of representatives of the ideal equivalence classes in 


3(L). 


Theorem 2.5 Let L be an absolutely irreducible H -representation module. 
A system R(L) of representatives of the ideal equivalence classes in 3(L) is 
given by all those centerings of L that are not contained in XL for all integral 
ideals YU of R unequal to R. If 3 is a set of representatives of the ideal classes of 
the ideals of R, a system of representatives of the R-equivalence classes of the 
RG-modules that are K-equivalent to L is given by {AM|%eE3 and 
Me R(L)}. 

Proof Only the first part of the theorem is left to prove. Let M be a 
centering of L. By the invariant factor theorem for R-modules there exists 
element m,,..., m, in L, fractional ideals %,, ..., &, of R, and integral ideals 
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QW, “609 YW. of R with Wy = 41, 39 ae Yi, such that 


L= wim, Φ hm, (Ὁ “Ὁ Fm, 
and 


M= ἐδ, Wim, Φ ἐγ) Um, OB: Ὁ ἐδ, lM, 


(with the 91, uniquely determined.) Then 21; 'M is in R(L) showing that 
®(L) contains at least one centering of each ideal equivalence class. Now let 
M and N be two ideal equivalent centerings of R(L). Then N = 8M for 
some fractional ideal 8 of R. By using the above decomposition of Land M 
once more, we get M2, = αὶ and 8%, = R because M and N are in R(L). 
Hence 8 = Rand N=M. QED 


We can give a somewhat closer description of the set R(L) if we use 
primary decomposition of the factor modules L/M with M in R(L). 


Proposition 2.6 Every centering M in (L) is the intersection of uniquely 
determined centerings M(B) in R(L) with P-power index in L, where is a 
prime dividing the index of M in L. Conversely, given finitely many elements 
M() in R(L) with L: M(B) a B-power for different prime ideals $B of R, then 
the intersection of the ΜᾺ) is again in R(L). 

Proof Let Με ®(L). The M() are easily obtained from a primary 
decomposition of L/M. Let L= @?_, &,m; and M = @?7_, ; Fm; with 
m; € L, &; fractional ideals of R and 9; integral ideals of R with 91, > W, > 
DU. Let U, = Py -.- PR Then M(P;) = PKL + M for i=1,..., k. 
Conversely, let M(®;) (i = 1, ..., k) be centerings in R(L) of P,-power index, 
where the , are different prime ideals. Let M be the intersection of the 
M(§;). M is in R(L) iff the first invariant factor ideal of M in Lis equal to R. 
But this factor ideal is the product of those of the ΜᾺ), which are equal to 
R because the M(B;) are in R(L). QED 


Let Ry denote the ring of B-regular elements in K. By standard results 
on localization of rings and modules one sees easily that for any centering M 
of L with 8-power index we can view L/M as an Rg-module such that 

L/M = UAL/AM and RgM = Κῳ Μ 
ΚΦΗ ΚφΗ 

for any ideal 91 of R. Using these remarks, Theorem 2.5, and Proposition 2.6, 
one easily derives Maranda’s results [10] that (in our terminology) two 
centerings of L are ideal equivalent if and only if they are in the same genus, 
1.6., their localizations at all prime ideals of R are isomorphic, and that the 
class number of L is the product of the class numbers of R and those of Re L 
for all primes Ἔ of R. It is also clear now that the determination of R(L) can 
be reduced to that of the R(Ry L) where f runs through all primes such that 
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there is a centering between L and BL or equivalently such that L/BL 
becomes reducible as an H-module. Every centering M in R(L) with 
¥-power index in L is given by RgM OL with RgM in R(RgL). By 
Theorem 2.7 the other centerings of (L) are found by taking intersections. 
The problem to find (Rg L) remains. 

Our next result implies that all elements of R(L) lie between L and aL for 
a certain nonzero α in R. This implies the finiteness of R(L) if the nontrivial 
factorings of R are finite. This is the Jordan—Zassenhaus theorem for the case 
that L is absolutely irreducible. If L is an H-representation module, then 
there is a homomorphism of H into End,(L), φι: H > End,(L) with 
f(x)! = xl for all x ε H and! ε L Clearly ¢,(H) is a suborder of End,(L), 
which leaves exactly the same submodules of L invariant as H. Furthermore, 
the R-module End,(L)/¢,(H) is a torsion module. 


Theorem 2.7 Let L be absolutely irreducible and let I be the last invar- 
iant factor ideal of φ,(Η) in End,(L). 


(i) If M is a centering of L with invariant factor ideals B,,..., 8, in L, 
then 8, B; ' divides U. In particular all centerings of R(L) lie between L and 
YL. 

(ii) If B is a prime ideal dividing U, then either there is a centering in 
R(L) with B-power index in L or L/L is irreducible but not absolutely 
irreducible as an H/‘SH-module. (By tensoring with a suitable extension ring 
this last situation can be reduced to the first.) (The prime divisors of 2 are 
sometimes called critical primes.) 


Proof (i) By the invariant factor theorem there are elements m, Ε L, 
fractional ideals %; of R and integral ideals 91, of R for i= 1,...,n such that 
L= Of, &im,, M= Ol, 91, Fm, and U;|W;., for i=l, ..., ἢ -- 1. 
Hence we can identify End,(L) with the ring of matrices {(a,;)| ay € & 19; 
(k, 1= 1, ..., n)}. By identifying End,(M) with the corresponding ring of 
matrices one sees that the last invariant factor of Endg(L) © End,(M) in 
End ,(L) is 8; 18... Because $,(H) is contained in End,p(L) © Enda(M), we 
get B, 18, |W. 

(ii) Let 31. Then those endomorphisms of L/PL induced by H 
cannot be the full R/P-endomorphism ring of L/PL which is isomorphic to 
(R/P)"*". If there are proper H-submodules of L/L, the first assertion 
holds. If not, L/PL is irreducible and the image of H in Endga(L/PL) is 
isomorphic to D*** where D is a skew field containing Καὶ ἢ properly by the 
remark above. QED 


We remark that the last invariant factor of @,(H) in End,(L) depends on 
the R-class of L. If H is the group ring RG of a finite group G such that the 
characteristic of R does not divide |G|, then we can prove by using Schur’s 
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relations [5, 8], that all centerings in R(L) lie between L and (| G|/n)L, where 
n is the R-rank of L. In Brauer [2] and Brauer and Nesbitt [3], Schur’s 
relations have been used in a similar way. 


Theorem 2.8 Let G be a finite group and L an absolutely irreducible 
RG-representation module, where the characteristic of R does not divide |G |. 
If M is a centering of L with invariant factor ideals B,, ..., 8, (n = R-rank of 
L), then |G|/n lies in 8, 8; '. In particular all centerings in R(N) lie between 
N and (|G|/n)N for any N € 3(L). 

Proof As in the proof of Theorem 2.7 let L= (Ὁ, §;m; and 
M = @/., 8; %,m;. Let D be the K-representation of G belonging to KL 
with respect to the basis m,,..., m,. It D(g) = (d;,;(g)) for g € G, then Schur’s 
relations assure that 


ee ere 


n geG 


διὸ jk 


The invariance of the lattices L and M respectively implies d,,(h) € ὅς, &, ° 
and d,(h) ε 8, ὃ, 8, ᾿ξ; 1 for all he G and 1 <s,t <n. By taking ἢ = g, 
s=i,t=j,andh=g~',s=j,t =i we get 


|G|/n Ε 6/0; 5 δ; 15, ἐδ; ΞΞ Bi Br". 
This implies the theorem if we choose i=nandj=1. QED 


Corollary 2.9 If under the assumption of Theorem 2.8 |G|/n is a unit in 
Κι, then R(L) consist only of L. 


Corollary 2.10 Under the assumption of Theorem 2.8 the K-class of L 
splits into finitely many R-classes if and only if the ideal class group of R is 
finite. 


It is clear that Theorem 2.8 imposes stronger restrictions on RG- 
representation modules concerning the finiteness of the class number than 
Theorem 2.7 does for H-representation modules. For instance, Theorem 2.8 
implies that the class number of an irreducible C[x]G-representation module 
is 1,"whereas there are certainly C[x] orders H with modules having an 
infinite class number. For instance, 


He (che k ! <{x"?) 


has L = C[x]?*' as a representation module in the obvious way. The class 
number is infinite; in fact those centerings of R(L) that lie properly between 
L and xL are in one-to-one correspondence with the points of the complex 
projective line. In Corollary 3.9 we shall be able to give necessary and 
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sufficient conditions for an absolutely irreducible H-representation module 
to have a finite ‘R(L). 


3. 


In this section we want to investigate the lattice 3(L) of centerings of L. 
Throughout the whole section we assume R to be a principal ideal domain if 
not otherwise stated. (This is legitimate because of the results of the preced- 
ing section.) We shall first prove a somewhat stronger version of Brauer and 
Nesbitt’s result that the composition factors of M/pM for Με 3(L) are the 
same as of L/pL where p 1s a prime element of R. We need not assume 
irreducibility for this. 


Definition 3.1 Let M, M’, N, N’ be centerings of L with M’ c M and 
NN. 


(i) The pair (M, M’) is projectively neighbored to (N, N’), if either 


(a) McNand M=M ON (Fig. 1) or 
(b) NcCMandN=NaAM. 


N 
<> 
MW 
Fic. | 


(1) (M, M’) and (N, N’) are projectively related if there is a chain of 
pairs of centerings (L,, 14}, (L2, L5), ..., (L,, L,) with (L,, L,) = (M, M’), 
(L,, 1.) = (N, N’) such that (L;, 10) is either projectively neighbored to 
(Li+ 1, Li4,) or there is an « € K with (L,,,, Lj.,) = (αἰ, αἰ). 


It is clear that “being projectively related” is an equivalence relation on 
the pairs (M, Μ΄) of centerings of L with M’ < M.If(M, Μ΄) ἀπά (N, N’)are 
projectively related, then M/M’ and N/N’ are isomorphic. The converse is 
not always true. The first fundamental result is the following theorem gener- 
alizing the Jordan—Holder theorem for our situation. 


Theorem 3.2 Let L be a (not necessarily irreducible) H-representation 
module with R a principal ideal domain, let p be a prime element in R, and let M 
be a centering of L. For any two H-composition series 


L,=L>L,>°:>L=pL and M,=M>M,>-°:->M,=pM 
of L/pL and M/pM respectively, | = k holds and there exists a permutation π of 
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the k indices such that (L;, L;4,) is projectively related to (M qi), M qiy+1) for 
ἷἱ Ξε ἴ,.... Καὶ -- 1. 

Proof Any two composition series of N/pN are projectively related in 
the above sense for any centering N of L, which is easily seen from the 
Schreier—Zassenhaus proof of the Jordan-H6lder theorem. (Note R/pR is a 
field.) It suffices to prove the theorem under the assumption that L/M is a 
simple RG-module. If M 2 pL, the result is trivial. If M > pL, we choose 
L,=M and M,_, = pL. Then one sees immediately that (L, M) is projec- 
tively related to (pL, pM) and (M, pL) to itself (Fig. 2). The theorem 
follows. QED 


As a corollary we get the well-known result first proved by Brauer and 
Nesbitt under the assumption that H is a group ring [4]. 


Corollary 3.2 Under the assumption of Theorem 3.2 the irreducible con- 
stituents of R/pR ®p H-representations belonging to L/pL and M/pM are the 
same. 


This corollary is the basis for the following algorithm developed by 
Plesken [11] to compute (L) for absolutely irreducible L (if R(L) is finite). 
Assuming that A,, ..., A, are the irreducible constituent of L/pL for all 
primes p for which L/pL becomes reducible. (These are only finitely many 
primes by Theorem 2.7.) From any Με 3(L) new centerings are obtained as 
the kernels of all possible epimorphism of M onto the A;. Determining these 
epimorphisms as well as their kernels amounts to solving systems of linear 
equations over the field R/pR with p as above. Starting out with this 
procedure from L one has to test for each new centering N whether it lies in 
(1) (and does not have to compare it with all the previous ones one has 
obtained), 1.6., one has to check whether the first invariant factor of N in L is 
a unit. This procedure eventually yields all centerings of R(L) (if R(L) is 
finite). For details, the reader is referred to Plesken [11] or Plesken and 
Pohst [12]. 

Before we proceed investigating the lattice 3(L) we make two simple 
remarks. 


(1) From the idea of the algorithm it is clear that R(L) consists of L 
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alone iff for all primes p of R the module L/pL stays irreducible. Let R have 
the property that for all prime ideals of R the factor field has the same 
characteristic as R. If H = RG for a finite group G, then |R(L)| = 1 
independent of the characteristic of R. More generally, if an order H’ over 
any dedekind domain Κ' = R with an absolutely irreducible representation 
module FE contains an R-order H with an absolutely irreducible representa- 
tion module L such that H’ = R’ @gH and L = Κ' @gL, then R(L) con- 
sists only of L in case R is a field. 

(2) The second remark refers to the ideas developed by Schur in [13]. 
We assume R is a dedekind domain. One can predict the Steinitz invariants 
of the centerings of L if one knows the Steinitz invariant of L and the indices 
of all centerings in (Rg L) for all those critical primes ‘8 which are not in an 
ideal class of the nth power of some other ideal (n = R-rank of L). If for 
instance the exponent of the ideal class group of R divides the R/f-rank of 
all irreducible constituents of L/L for all critical primes ‘B, then all center- 
ings of L have the same Steinitz invariant. It would be interesting to find a 
Dedekind ring R, a finite group G, and an RG-representation module L such 
that the Steinitz invariants of all centerings of L are not principal, 1.6., the 
matrix representation of KL cannot be transformed into one where all 
matrix entries lie in R. 


We come back to the lattice 3(L) and assume L to be absolutely irredu- 
cible of R-rank n and R to be a principal ideal domain. The lattice is 
modular and by Proposition 2.6 splits into the direct product of the sublat- 
tices 3,(L) of the p-centerings, as we want to call the centerings with p-power 
index in L (pa prime of R). For noncritical primes p, the p-centerings of L are 
given by p*L with α ε Z*°. Further 3(L) is periodic in the sense that 3(L) is 
the disjoint union of all 2UR(L) with 2 running through all integral ideals of 
R. We first want to discuss restrictions imposed on 3(L) by Theorems 2.7 
and 2.8. 


Definition 3.4 The biggest « € Z*° such that p* is the last invariant 
factor of M in N with M, N ε 3,(L) and M ε R(N) is called the critical 
p-exponent of L for any prime p of R. 


We note that Theorem 2.8 yields an upper bound for the critical p- 
exponent in the case H = RG, namely the biggest « with p*|(|G|/n). 
Theorem 2.7, however, does not give such a bound immediately. But using 
Theorem 2.7 and the kind of argument used in its proof, one gets 2« as an 
upper bound for the critical p-exponent, where ρ΄ divides the last invariant 
factor of $,(H) in End,(L) but not p**! (compare Theorem 2.7 for the 
terminology). Hence the critical p-exponent is always well defined. 


Theorem 3.5 Let M, N ε 3,(L) for some prime p of R such that M, N + 
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ΜΈΝ. Then M, N ε ®(M + N). If p* and p® are the last invariant factor of 
M, resp. N, in ΜΞΝ, then a+ B is smaller than or equal to the critical 
p-exponent of L. 

Proof The first statement follows easily if one chooses compatible bases 
for N and M and uses these bases to construct a basis for N + M in an 
obvious way. From the second isomorphism theorem we conclude that α is 
the smallest natural number such that p*N Ξ M, hence ΡΝ € R(M) (Fig. 3). 


N+M 


ΡΝ 
Fic. 3 


Clearly p***(M/p*N) = 0. If p’M © p*N with y < « + 8, this would contra- 
dict the definition of β as the p-exponent of the last invariant factor of 


N+M/N2=M/MQAN =(M/p’ *M)/(M a Ν ρ΄ "Μ) QED 
As a first corollary we get a result first proved by Brauer in [2]. 
Corollary 3.6 Let H be a group ring RG and p be a prime of R. If 


p|(|G|/n) but p? +(|G|/n), then the p-centerings are linearly ordered and 
R(L) A 3,(L) consists of those M € 3(L) with pL ἘΜ < L. 


We want to ask now, How many maximal centerings N; of N € 3(L) can 
exist such that the ΝΙΝ, are all isomorphic to each other? 


Lemma 3.7 Let N € 3(L) and let A be an irreducible constituent of ΠΡ. 
Then Hom,,(N, A) is a vector space over End;,(A), the dimension of which is 
bounded by the multiplicity of A in L/pL. The centerings M of N with 
N/M = @{_, A are in one-to-one correspondence with the k-dimensional sub- 
spaces of Hom,(N, A) (over End,(A)). 

Proof For g ε Hom,(N, A) and a € End,(A), we get 


ao m € Hon,(N, A). 


Since A is irreducible, End,(A) is a skew field. So Hom,(N, A) is a 
End ,(A)-vector space. Now the rest of the statements follows easily. QED 


Corollary 3.8 If under the assumption of Lemma 3.7 the field R/pR is 
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finite of q elements, then the number of the M € 3(N) with N/M = A is of the 
form (q°? — 1)/(q* — 1) with α = ἀϊπι ρα Ἐπά (4) and B < multiplicity of A in 
L/pL. 


Corollary 3.9 Let the fields R/pR be infinite for the critical primes p of R. 
Then (L) is finite if and only if for any two N, Με 3(L) with N < Mand N 
maximal in M there is no other centering Ν' of M with ΜΙΝ = M/N'. 

Proof Follows immediately from Lemma 3.7 and Theorem 2.7. QED 


As an important consequence we remark that there is at most one center- 
ing M of N with N/M = A in case the multiplicity of A in L/pL is one. We 
want to prove that there always exist a centering N of L that does not only 
have the property that there is only one centering M < N with N/M ~ A but 
also has M as the only maximal p-centering. We can even do better than this 
by introducing a new type of constituent of L/pL using the ideas of 
Definition 3.1 and Theorem 3.2. 


Definition 3.10 Let N be a centering of Land letN, = N>N,>-°°:> 
N, = pN be a composition series of N/pN. We call the projective equi- 
valence classes of the pairs (N;, N;4,) the projective composition factors of 
N/pN (i= 1,...,k — 1). The multiplicity of such a composition factor is the 
number of the indexes i (i= 1, ..., Καὶ — 1) with (N;, N;+,) lying in this 
equivalence class. 


It is clear from Theorem 3.2 that the definition of projective composition 
factor does not depend on the chosen centering N. If (N, M) belongs to a 
projective composition factor of L/pL, then N/M is (an ordinary) composi- 
tion factor of L/pL. However, (N,, M,) and (Ν;, M,) may belong to differ- 
ent projective composition factors of L/pL though N, /M, is isomorphic to 
N,/M),. So in general there are more projective composition factors than 
ordinary ones and the multiplicities of the projective ones are smaller. It is 
clear that both concepts are the same if L/pL does not have repeated (ordi- 
nary) composition factors. 


Theorem 3.11 Let (M, M’) belong to a projective composition factor of 
L/pL. Then there is a p-centering N of L such that N has exactly one maximal 
p-centering N'. Further, (N, N’) is projectively equivalent to (M, M’). 

Proof Let N be a p-centering of M with the following properties: (i) 
N « M’; (ii) there is no p-centering N of M contained in N with property {i). 
By Theorem 3.5 each N with (i) is contained in (M). By Theorem 2.7 there 
exists an N with (i) and (ii) (though it might not be unique). N satisfies the 
assertion of the theorem (possibly after multiplication with an element of K 
to make it a p-centering). Take N’ = Μ' © N. If there were a maximal 
p-centering N” of N with N” # N, then N” is a p-centering of M satisfying 
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(i); hence N does not satisfy (ii), which is a contradiction. QED 


We now want to investigate those lattices of centerings that have the 
property that for any projective composition factor and any centering M 
there is at most one centering M’ of M such that (M, M’) belongs to the 
given composition factor. 


Proposition 3.12 The lattice of centerings of L is distributive if and only if 
it satisfies the following condition: 


(*) For any projective composition factor C of L and any centering M of 
L there exists at most one centering Μ' of M with (M, M’) belonging to C. 

Proof This is an immediate corollary of Birkhoff’s theorem [1] that a 
modular lattice is distributive if and only if it does not contain sublattices of 
the form shown in Fig. 4. Another proof follows from Lemma 3.16. QED 


Fic. 4 


Proposition 3.13 Let L satisfy condition (*) of Proposition 3.12. If 
M,N, N’ ε 3(L) with N, N’ SM such that ΜΙΝ and M/N' have the same 
projective composition factors (with the same multiplicities), then N = N’. 

Proof We use induction on the number of composition factors of M/N. 
If this number is one, the theorem follows from (8) immediately. Assume the 
theorem is true if M/N has less than m composition factors (multiplicities 
counted !). If there is a centering M’ of M, unequal to M and containing both 
N and N’ we can apply the induction hypothesis. Let 


M=No>N,>°°:>N,=N 
M=No>N,>°°°>Nya=N 


be composition series of ΜΙΝ and M/N’. Let i be the smallest index such 
that (Nj, Nj.,) belongs to the same projective composition factor as 
(No, N,). Then the composition series of M/N’ can be replaced by 
M=No>Ni{>N2>°°°>Na = N' where Nj = Nj_, ὦ Ν, for j = 1, 2, 
...,Land Nj = Nj forj=i+ 1,...,m. Note that Nj,, = N; © N,. Because 
of N, = N{ the induction hypothesis can be applied. QED 


Proposition 3.14 Under assumption (*) of Proposition 3.12 the projective 
composition factors of L/pL have multiplicity one. 
Proof Lett L=L,>L,>°:->L,=pL be a composition series of 
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L/pL. Let (L;, L;, 1) be projectively equivalent to (L;, L;,,) with i <j. Then 
there is a chain (N,, N‘), (N2, N4),..-, (Ns, Nj) of pairs of centerings such 
that (N,, N4) = (L;, L;41) and (N,, Nj) = (p"L,, p*L;+ 1) for some « € N and 
(N,, N;,) is a projective neighbor of (N,+1, N,+1). But (*) implies that 
(N,,N,+1) Or (Nia1, N;,) (respectively if N, >N,41 or if Ny+1 = Ny) is not 
projectively equivalent to (L;, L;,,). Hence if M is a centering containing 
all the N,, the multiplicity of (L;, L;,,) in ΜΙΝ, is the same for all N;. So 
(L;, L;+1) cannot be projectively equivalent to (L;, L;.,). QED 


J 
Definition 3.15 Assume (*) of Proposition 3.12 and let L = L, > L, > 
--+>D,,, = pL be a fixed composition series of L/pL. Let M,N be p- 
centerings of L with M C N. We denote M by N(a,, ..., «,) where «; (i = 1, 
...,k) are the multiplicities of the projective composition factor (L;, L;,,) in 
N/M. 


Lemma 3.16 Under the assumption (*) of Proposition 3.12 and in the 
terminology of Definition 3.15 the following hold: 


(i) p°N(a4,..., αι) = N(a+a,,...,a+0,) fora ε Z7°; 
(ii) ᾿ς en ἀμ) ΓῚ Νίβι. ὅπ 59 Bx) = Νῴ(γ:: eas γι) with ee max(q;, Bi) 
a ΠΡ 

(iti) N(a,,..., αι) + N(B,, .... B,) = N(d1, ..., 6,) with 6; = min(«;, B;) 
(i= 1,...,k); | 

(iv) Noy, .... ακ)(χ,»..., χι) = N(x, + 4,-2.5 ἄχ +X); 

(v) Νία,, ..., αι) ε R(N) iff at least one αἱ =0;where N(a,, ..., ἀμ), 
N(B,, .-.. By), and N(a,, ..., &)(X1, ..., X,) are p-centerings of N. 

Proof (i) and (iv) are trivial. We prove (ii) and (iii), i.e., y, = max(«;, B;) 
and 6; = min(q;, B;). From the second isomorphism theorem we see im- 
mediately that the first set of equations implies the second. To proof the first 
equations one proceeds similarly to the proof of Proposition 3.13 by using 
induction on δ, αἱ ἘΣ ες B;. QED 


Our next aim is to describe the biggest R-order in End,(KL) that leaves 
all the centerings of L invariant. Of course it is given by (} μεφ Endp(M) 
where we view the End,(M) as orders of Endx(KL). We shall prove that this 
is a graduated order as introduced by Zassenhaus [14] iff 3(L) satisfies (*) of 
Proposition 3.12. 


Definition 3.17 Let R be a local. Dedekind ring with prime element p. A 
graduated order over R is a suborder of Κ΄" containing n orthogonal 
primitive idempotents e,, ..., e, with e, ἘΠ +e, = I, (= unit matrix). 


We give a short summary of the properties of graduated orders [14]. 


Theorem 3.18 Let A be a graduated order contained in R"*" with primi- 
tive idempotents e,, ..., 6, (and R local). A can be transformed in such a way 
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that the e; become the diagonal matrices diag(0, ..., 0, 1, 0,...,0) with the 1 in 
the ith place; and further there are natural numbers n,, ..., n, with 
ny ἘΠ +n Ξε ἢ ἀπά a, Ε Z*° ὦ {co} (i= 1,..., k) such that αἱ; = 0 only if 
i # j and a;; = 0 and 


A = {(p*a;;)|a,; ε Ἀπ πὴ 
where p® stands for zero. The αι; satisfy the obvious conditions 
αι; τ ay 2 ἂς i,j,k =1,..., ἢ 


which are necessary and sufficient for A to be closed under multiplication. 
KA = Κ΄" iff all a,; € ΖΞ Ὁ. In this case the submodules (centerings) of R"™ 
invariant under left multiplication are given by 


p*'ay 
REO Gc ..., Oe) = : with a,;d; >a; aje ΚΠ Ὶ forj=1,...,k}. 
p**a, 


If R is a nonlocal principal ideal domain, then a suborder of Κα " is 
called a graduated order if it becomes a graduated order by localization at 
all primes p of R. Theorem 3.18 can be generalized in an obvious way for this 
situation. From Theorem 3.18 we see that the centerings of an irreducible 
graduated order form a distributive lattice because the rules of Lemma 3.16 
are fulfilled. We also have compatible bases for the centerings of R"*’ in the 
sense of Zassenhaus [17], i.e., one gets a basis of each centering by multi- 
plying the elements of the basis, 


1 0 0 
0 1 0 
0], LObea5h ets 
1]: 0 
0 0 1 


by certain elements of R. We can now characterize the modules satisfying (*) 
in Proposition 3.12. 


Theorem 3.19 Let L be an absolutely irreducible representation module of 
an R-order H where R is a principal ideal domain. The following statements are 
equivalent : 


(i) L satisfies condition (*) in Proposition 3.12. 
(1) The lattice 3(L) of centerings of L is distributive. 
(iii) For all prime elements p of R the multiplicity of the projective compo- 
sition factors of L/pL is equal to one. 
(iv) There exists a system of compatible bases for the centerings of L. 
(v) The maximal suborder of End ,(L) leaving all the centerings of L 
invariant is a graduated R-order of K"*". 
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Proof In Proposition 3.12 we proved already that (i) and (ii) are equiva- 
lent. By Proposition 3.14 statement (i) implies (iii); the converse of this is 
trivial. By Theorem 3.18 the last two statements are equivalent and also by 
Theorem 3.18 one sees that (v) implies (i). We show that (i) implies (iv). In 
Zassenhaus [17] necessary and sufficient conditions for the existence of com- 
patible bases for systems of submodules of a finitely generated abelian group 
are given. The results of Zassenhaus [17] can easily be generalized to finitely 
generated modules over principal ideal domains in an obvious way. To 
prove (iv) we then have to check the following conditions for 3(L) = Σ: 


(a) 3(L) is distributive. 

(b) For any prime p of R, let 2, be the set of all factor modules M/p*L, 
where « is the critical p-exponent of L and Με 3(L) with p*L¢ M. 

(Ὁ.)  p*(my m2) = p*m, σὰ p"m, for all με Z*° and m,, m2 € Σ,. 

(b,) Let me, m' = Dw ες (p’m’) © mand define (p~ ’, X)m to be the 
intersection of all γι" ΕΣ with p’m” =m’. For any v € Z?° the operator 
(p”’, Σ) has to satisfy the (p"', X)(p-’, L)m = (p-* ?, X)m for all me Σ. 


Condition (a) is satisfied because of Proposition 3.12. One easily checks (b,) 
by means of Lemma 3.16. We check (b,). Let m = L(a,, ..., «,)/p"L ΕΣ. If 
v >a we get (p ἡ", &)m = p*L/p"L. If v < a, one sees easily (p~’, L)m = L(B,, 
..-, B,)/p"l with B; = max(v, αἰ) — v. (Ὁ2) now follows easily. QED 


The following application of Theorem 3.19 which is already partly 
proved in Zassenhaus [17] might be of interest. 


Corollary 3.20 Let R be a principal ideal domain, L a finitely generated 
free R-module, and let MN be a finite set of submodules of L of the same R-rank 
as L. Then there exist compatible bases for L and the submodules in M if and 
only if the lattice M is distributive, where M is generated by L and the elements 
of IR by taking intersections, sums, and by multiplying with elements of the 
quotient field of R such that the resulting modules are contained in L. 

Proof Let H = End,(L) ἡ (μι Endr(M), where End,(M) is con- 
sidered to be an order in Endx(KM). Then H is an R-order and L an 
absolutely irreducible H-representation module with Xt < 3(L). That the 
existence of compatible bases implies the distributivity of It was already 
observed in [17]. (This can also be obtained by applying Theorem 3.19.) As 
for the converse we apply a slight generalization of Theorem 3.19. One sees 
easily that the distributivity of IM implies condition (+) of Proposition 3.12 
for Mt. From this one concludes that the results in Propositions 3.13 and 3.14 
and Lemma 3.16 are also valid for IN, as well as the version of Theorem 3.19 
for WM instead of 3(L). If at some place the finiteness of the critical p- 
exponents is used, the arguments are still valid, since these exponents are 
also finite for Wt because of Mt 3(L). Now this version of Theorem 3.19 
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shows that there is a system of compatible bases even for all modules in Mt. 
This finishes the proof. We remark, however, that one can conclude now by 
using Theorem 3.18 that 3(L) and Mare equal. QED 


The next corollary of the proof of Corollary 3.20 can be considered as a 
partial converse of Theorem 2.7. 


Corollary 3.21] Let R be a principal ideal domain on L, a finitely gen- 
erated free R-module. Let Wt be a finitely generated (in the sense of Corollary 
3.20) lattice of submodules of L that have the same R-rank as L. Then the 
maximal suborder H of End,(L) leaving the modules in IM invariant has the 
same R-rank as End,(L) and L is an absolutely irreducible H-representation 
module. (It might happen that Dt < 3(L).) 

The next application of the ideas developed in this chapter can already 
be found in Plesken [11]. 


Theorem 3.22 If L/pL has two projective constituents each with multi- 
plicity one, then there is a centering M of L such that the p-centerings in R(M) 
are linearly ordered. M can be chosen to be any centering of L such that M/pM 
is indecomposable. 


Proof The proof follows from Theorem 3.11 and Lemma 3.16. In the 
notation of Definition 3.15: 


3,(L) ο (Μὴ) = {M(0, 0), M(1, 0), ..., M(6, 0)} 
or {M(0, 1),..., M(, δ), QED 


We finish this section with a complete list of distributive lattices of 
p-centerings of L, where L/pL has three composition factors, and the critical 
p-exponent « of L is less than or equal to three. So we have the combinatorial 
task of finding all graduated orders as described in Theorem 3.18 with ἰ = 3 
and αἱ; + «;; < 3. Note that max;, (α;; + ἀμ) is the critical p-exponent of 1. If 
the critical p-exponent ἃ is one, the centerings are linearly ordered. If « = 2, 
there are four possibilities and for « = 3 seven possibilities: The maximal H 


order leaving the centerings invariant can be transformed to 


{9 5] Q12 413 
— a2 X73 njXn; 
(a1, &23 5 ἅ31» %32) = p** az a22 p*"a23 [ας] ΕἸ) 
a3 a3 
Ρ αι Ρ΄ 632 a33 


with (% 1, %23, %34, %32) equal to (1, 0, 2, eae , 0, 2, 2), (2, 0, 2, 2), and (2, 1, 2, 
1) for α = 2, and (1, 0, 3, 2), (1, 0, 3, 3), (2, 0, 3, 2), (2, 0, 3, 3), (2, 1, 3, 2), 
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(3, 1, 3, 2), and (3, 0, 3, 3) for « = 3. The class numbers are 4, 5, 6, 7, 6, 7, 8, 9, 
8, 11, 10. The lattices are shown in Fig. 5. 


L L L L ι. ι. 
L(O, 0, 1) pL 
L(O,1,1) 
etc. 
L(O, 1,2) pL pL pL pL pL 
pL(O, Ο, 1) 


H(1, 0, 2, 1) H(1, 0, 2, 2) Ι ] 
-“ 
H(2,0, 2,2) H(2,1, 2, 1) H(1,0, 3,2) H(1, 0, 3, 3) 


Fic. 5 


We want to make some comments on the behavior of centerings if one 
replaces the Dedekind domain R by a Dedekind domain Κ΄ containing R 
such that the quotient field Κ' of Κ΄ is a finite field extension of K. Let 
H'’=R' @rH and L=R’' @,L for any H-representation module L. In 
Zassenhaus and Reiner [18] it is proved that L, = Ly iff L; = L,, in case R is 
local. If L is absolutely irreducible, a somewhat more precise description of 
the situation can be given. 


Theorem 4.1 Let L be absolutely irreducible. 


(i) [18] The mapping 3(L)—> 3(L): M—> R'@M=M’' is injective. 
R(L) is mapped into R(L). 

(ii) Let Ὃ be a (critical) prime ideal of R and let the B-centerings of R(L) 
all contain ¥°L. Let PB’ be an unramified prime ideal of Κ' dividing . The 
mapping ®: 34(L) ὦ R(L)—> 3y(L): Μ-- Μ' + $°L is injective. Assume 
that the constituents of L/BL are absolutely irreducible. ® is a bijection if and 
only if R/B = Κ΄ Β΄ or if the lattice 3(L) of B-centerings of L is distributive. 

Proof (i) Let M, Ne 3(L) with M’=N’. Choose re R such that 
rM © N and denote the invariant factor ideals of rM in N by W,,..., ,. 
Then (ΓΜ) = γΜ' € N’ with invariant factor ideals 9, R’,..., U,, R’. Part (i) 
now follows easily. 

(ii) The injectivity of ® follows from the proof of part (i) of this theorem 
and from Proposition 2.6. To prove the last statement we first note that the 
maximal $’-centerings correspond to certain one-dimensional subspaces of 
certain R'/f'-vector spaces of homomorphisms as described in Lemma 3.7. 
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The maximal $-centerings of M correspond to the one-dimensional sub- 
spaces of the corresponding R/B-vector spaces from which the R’/’-spaces 
under consideration can be obtained by tensoring with R’/B’. Now it is easy 
to see that the R’/8’-subspaces are in one-to-one correspondence with the 
R/B-subspaces iff Κ΄ ἢ = R’/P' or if all the spaces are one dimensional, i.e., 
the lattice of ¥-centerings 15 distributive. QED 


Corollary 4.2 If under the assumption of Theorem 4.1 ' is ramified with 
ramification index a, then the number of '-centerings in R(L) is greater than 
or equal to 


a+ (number of B-centerings in R(L) — 1) + 1 
and the critical §B’-exponent of L is greater or equal to 
α΄ (critical %-exponent of L). 


If Με 34(L) © R(L) such that no proper centering of M has this property, 
then M’ is also minimal in 3y(L) © R(L). 

Proof The last statement follows immediately from Theorem 4.1 (ii) 
and the definition of R(L). To prove the first result let M, N € 3,(L) with 
M <N maximal and let Rg = (B8')"U. Then 

ΝΊΜ’ > B'(N'/M’) 3 --- > (ΒΎ(Ν, Μη 
where all inclusions are proper. The inverse images of these modules are 
different $B’-centerings of L. 


The result follows now from Proposition 2.6 and an easy counting 
argument. QED 


Distributivity of lattices of centerings is not always preserved under 
ramified extensions if the projective composition factors are not in one-to- 
one correspondence with the (usual) composition factors. For example, the 
dihedral group of eight elements has an irreducible Z-representation of 
degree 2. The lattice of 2-centerings is linearly ordered and the constituents 
of L/2L are the 1-constituents. (Dg is a 2-group!) By taking this representa- 
tion over the gaussian integers Z[i], the lattice of (1 + i)-centerings becomes 
nondistributive (Fig. 6). By Corollary 3.8 there are three maximal 
(H;)-centerings of M’ + (1+ iL. Hence 3,,4(Z) is not distributive. 


ι΄ 


Μ over Z — Μ΄ (1«-.ἡι. 


δι. 
2b 


Fic. 6 
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We want to consider some examples of absolutely irreducible ZG- 
representations, where G 15 a finite group. 

Let G be a 2-transitive permutation group of Q={1, ..., n}. Let 
I = @®"_, ΖΕ, be the representation module of the permutation representa- 
tion, ie., gE; = E,y) for g € G and i € Q. The factor module L = I/Z }j-; Εἰ 
is an absolutely irreducible representation module by Burnside’s lemma [8]. 
Denote the coset of E; by e;. In case G = S,,, the constituents of the factor 
modules L/pL are well known [6] for all primes p: 


(a) L/pL is irreducible for p {η. 
(Ὁ) L/pL has two constituent for p|n. (One of the constituents is the 
1-constituent.) 


In [9] it is proved that (a) and (b) also hold for G = A, or certain of the 
Mathieu groups. 


Theorem 5.1 Let G be a 2-transitive permutation group such that (a) and 
(Ὁ) hold (for instance G = S,,, A,). Then the set of representatives R(L) con- 
sists of all centerings Ly of L for d\n, where 


L, = {72} a;e; | d divides δ᾿ ἢ-α a. 


The index of L, in L is d. Hence the class number of L is the number of the 
divisor of n. 

Proof Because of (a) there are no p-centerings for p }n. Let G act 
trivially on 2Z/nZ Then L,=kerg with o:L-2Z/nZ via 
yr, ae; ) 7-1 G- Φ is well defined because of δ, e; = 0 and is a ZG- 
homomorphism. For d|n, we get L, as the inverse image of d " (Z/nZ) under 
φ. Hence the invariant factors of L, in L are 

| eee ba 2 


——__ 


n—2 


So L, € R(L). To show that there are no other centerings in (L) we make 
use of (b) and apply Theorem 3.22. We only have to show that L/pL and 
(applying Proposition 2.6) L,/pL, are indecomposable for p|n. So let 
g δῖ, a;e; = 7, αν (mod p), hence 


(1) 5" ay = Ο a(n) = o (mod ΡῚ 


for all g eG. By using the 2-transitivity of G, one gets αι, ΞΞ ἀ ΞΞ "᾿Ξ 
a, (mod p), hence )7_, a,e; = 0 (mod pL), i.e., L/pL is indecomposable. The 
indecomposability of L, /pL,, follows immediately from the observation that 
the modules L and L, are dual (via the quadratic form of L fixed by G). We 
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indicate another proof: L, is generated by e, — 6; (i, j=1,...,n,i# j). Let 
gp: L,— Z/pZ be a ZG-homomorphism, where G acts trivially on Z/pZ. Be- 
cause of the 2-transitivity of G all ὁ; — ὁ ; are mapped on the same element of 
Z/pZ. For odd p this implies immediately » = 0 because p(e; — e;) = 
φίε; — e;). If p = 2, one gets 3p(e; — e;) = ee; — e;) = 0 from (e, — 62) + 
(62 —e3)+ (e3-e,)=0. Hence φῦ and L,/pL, is also 
indecomposable. QED 


For G=S,, Theorem 5.1 was already stated without proof in [7]. A 
similar proof as above can be found in [11] also for G = S,. 

For our next example let G be an irreducible subgroup of GL(n, Z) 
containing all the diagonal matrices of GL(n, Z). It is clear that G is a 
semidirect product of the elementary abelian 2-group ἢ, of all diagonal 
matrices of GL(n, Z) of order 2” and a transive group P,, of permutation 
matrices of degree n. 


Theorem 5.2 Let G = D,,P,, as described above and L = Z"*'. Then all 
centerings of ‘R(L) lie between L and 21, and are in one-to-one correspondence 
with the normal subgroups of G contained in D,, and unequal 1. 

Proof Let e,,..., e, be the standard basis of L and let e;; (i,j = 1,...,n) 
be the matrix units of Z"*" with e,;e, = 5,,e;. We want to show that 26; jare 
contained in the (Z-)enveloping algebra $(G) of G. Let g € P,, with ge; = ὁ ᾿ 
Then 


g — diag(1,..., 1, —1, 1,..., 1)g = 2e,; € o(G). 


j 
Hence the last invariant factor of φ(6) in Z"*" is 2. By Theorem 2.7 all 
centerings of ®(L) lie between L and 21, One easily sees now that D, and 
L/2L are equivalent G-modules where G acts on D,, by conjugation. QED 


Corollary 5.38 If P,, in Theorem 5.2 is cyclic, the centerings of L in R(L) 
are in one-to-one correspondence with the divisors # 1 of (x" — 1) in Z/2Z[x]. 


Corollary 5.4 If P,, in Theorem 5.2 satisfies the conditions (a) and (Ὁ) at 
the beginning of this section, then the class number is three. The centerings of 
L=2Z""' in ®(L) are 1, N={5%,4;e;|2 divides δ a} and 
M = {)".1 a,e;|0; = α; (mod 2) for i,j = 1,..., νὰ 


Note that for even n the 2-centerings of L in (5.4) are linearly ordered: 
L>N2>M 2>2L::-. L/N and M/2L are 1-constituents. This is an example 
that projective constituents and ordinary constituents differ from each other. 

As a further example we mention that it is proved by Zassenhaus [15] 
that for absolutely irreducible representation modules L of metacyclic 
groups the lattice of p-centerings is linearly ordered and the p-centerings of 
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R(L) lie between L and pL. (It is proved that φ, (6) is a hereditary order.) 
As a last example we take the 2-group of 8 x 8 matrices given by 


where A = [7 “27 and Β -- [9 ὁ]. Note that by Theorem 2.8 the only nontri- 
vial centerings of L = Z®*! are 2-centerings and that for p-groups there is 
only one modular irreducible representation. The centerings in this example 
were computed electronically by M. Pohst and myself by using the program 
written for Plesken and Pohst [12]. The lattice of 2-centerings is shown in 
Fig. 7. The class number is 32. But one can easily construct examples of 
2-groups of degree 8 with higher class numbers. 
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The Existence of P-adic Abelian L-functions 


CARY QUEEN+ 


UNIVERSITY OF CALIFORNIA 
BERKELEY, CALIFORNIA 


We first prove that a modular form on T ,(p’), r > 1, is a p-adic modular form. Using this fact, we 
establish the existence of a p-adic L-function for each abelian character of any totally real number 
field. 


Introduction 


It is a well-known but remarkable fact that the values of the Riemann 
zeta function at negative integers are rational. This phenomenon suggested 
to Kubota and Leopoldt the possibility of defining a “p-adic zeta function” 
on Z, by p-adic interpolation between these values [6]. More generally, they 
proved that for every Dirichlet character y, there is a continuous p-adic 
L-function L,(z, 5) on Z, such that L,(y, 1 — n) = L(y, 1 — n) for positive 
integers n = 0 mod(p — 1). The significance of this result was enhanced by 
Iwasawa’s later discovery of the connection between p-adic L-functions and 
cyclotomic extensions [3]. 

It would naturally be of interest to extend the concept of p-adic L- 
functions to any algebraic number field. This might be possible for at least 
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totally real number fields because then Lx(y, 1 — n) is an algebraic number 
[5], not always zero. However, the project has been stymied by the lack of 
congruences for L(y, 1 — n) analogous to those for Lo(1, 1 — n) = —B,,/n. 
Coates and Sinnott did recently treat real quadratic K using an explicit 
formula of Siegel [2]. 

J.-P. Serre created a new approach to the question when he introduced 
the concept of “p-adic modular form” [10]. Siegel had earlier noticed that 
Lx(z, 1 — n) is the constant term of a modular form. Serre’s L,(z, 1 — 5) 
turns out to be the constant term of a p-adic modular form. However, 
because he relied on a theorem that modular forms on Io(p) are p-adic 
modular forms, Serre’s treatment was limited to characters of extensions 
K(C,/K. 

The object of the present paper is to prove the existence of a p-adic 
L-function associated to any abelian character x of any totally real number 
field. The basic tool in our proof is a theorem that any modular form on 
To(p’), r = 1, is a p-adic modular form.t In order to be fully general, every- 
thing must be done in the setting of p-adic modular forms of level N, a 
generalization of Serre’s original idea due to Katz [4]. 

In Section 1 some results of Katz are rapidly redone in the style of Serre 
in order to provide a convenient starting point for the present work. 

In Section 2 the idea of a p-adic modular quasiform is introduced. This 
technical notion is precisely what is needed to prove our basic theorems. The 
proofs in this section are just slight variations of proofs of Serre [9, 10], but 
the results seem genuinely stronger. 

In Section 3 we show directly that certain modular forms on a group 
“T),(p*)” are p-adic modular quasiforms. These forms are then used to show 
inductively that modular forms of level N on I,(p’) are p-adic modular 
forms of level N. 

In Section 4 the previous results are extended to modular forms on I ,(p’) 
having characters with respect to I',(p’). In particular, it is shown that the 
higher the conductor of the character, the more rigidly is the constant term 
determined. 

Finally, we present a proof of the existence of p-adic abelian L-functions 
and discuss their continuity at s = 1. 

I would like to thank J. Coates, N. Katz, J.-P. Serre, and especially my 
advisor A. Ogg for help and encouragement. 


1. Preliminaries 


For any prime p, let 2, denote an algebraic closure of Q,, so Q, is a 
compositum of the field of p-adic numbers Q, and the field of algebraic 


+ We have been informed that N. Katz has also obtained this result. 
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numbers Q. Fix once and for all an extension v, of the p-adic valuation to 
Q,, normalized so v,(p) = 1. If K is an extension of Q in Q, let O, be its ring 
of integers and K its residue field. Should K be finite over Q, write 
K, = KQ, for its closure in Q,. 

Let G be any subgroup of T = SL(2, Z) which contains (6 ἢ), and put 
4 = e?™/N. A modular form f on G is said to be defined over K if the 
q-expansion of f at ico is δίψει a,q" with each a, € K. If in fact fis defined 
over Ox, ie, each a, € Ox, thenf= Yo ἃ, 4" € K[[q]] is called the reduc- 
tion of Κὶ Denote by «ἡ κ(6, k) the vector space over K of (holomorphic) 
modular forms on G of weight k defined over K, and by .4,(G, k) the 
K-subspace of K[[q]] consisting of the reductions of all forms in .@,(G, k) 
defined over Ox. Finally, let M,(G) be the subalgebra δ κε M x(G, k) of 
K[[q]] and .@(G) the corresponding subalgebra δῖ ,(G, k) of K[[q]]. 
For short, 9(G, k) will be abbreviated by .(G, k), M3(G) by (G), and 
so on. We take as our starting point two fundamental theorems. 


Theorem A (Swinnerton-Dyer) If p + 2, 3, then 


MT) ~ FQ, RI/(A — 1) 
where Q and R are indeterminates and A(Q, R) is the polynomial such that 
E,-1 = A(E4, Eg). If p=2 or 3, then MT) = F,[A] with A algebraically 


independent over Ἐ',. 


We write v,() 7-0 4,4") for inf, 9 v,(a,) and {x} for the least integer greater 
than or equal to x. 


Theorem B (Serre, Katz) Let f = δος aq" andf™ = V4 aq" 
be members of M(T(N), Κι) and (T(N), k,) respectively. Suppose r € Q, 
(p, N) = 1, and v,(f™ —f™) = v,(f™) +r. Then: 


(1) ifp#2andr>0,k, =k, mod(p — 1)p""'; 
(2) ifp=2andr>1,k, =k, mod 27) 2, 


The proof of Theorem A is the same, mutatis mutandis, as the proof of 
Theorem 2 in [12], with Q replacing Q and F, replacing F,. 

To prove Theorem B, we may assume that v,(f“) = 0 after multiplying 
f™ and f@ by an appropriate constant. Suppose f“) and f are defined 
over a number field L with maximal unramified subfield K, so Καὶ = [ and 
Ox, is the Witt ring W(K). Let w,,..., w, be an integral basis for L, over Καὶ: 
1.6., a basis for the module ©,, over the ring ©x,. Since (I'(N), k) always 
has a basis of forms defined over Q, there are K,-linear combinations {ἢ of 
modular forms over Q such that f = δὲ νν, 74) and f® =) jw, f?. 
Choose w,¢€ K, such thatW=W,, so for each j, v,(w;—w,)>0 and 
υν( [2 —f%) = [r]. As an easy consequence, if f=), w, f, then 
vf —f™) = [r] for [r] =r and v,(f —f™) > [r] for [r] #r. Since 
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f) —f@ is defined over the unramified field Καὶ » in either case 
υν( [6 —f™) > {r}. The theorem now follows by applying Corollary 4.4.2 of 
[4] to f® ε S(W(K), 1, N, Κι. 

In all that follows, by a limit of a sequence of series f® = δος aq", 
a ε Q,, we shall mean a series f= )'9 a,q" such that lim;.,, a? =a, 
uniformly for all n. Moreover, p will always denote a prime not dividing a 
fixed positive integer N, and K will denote some finite extension of Q. By a 
p-adic modular form f over K of level N, we mean a limit of a sequence 
{ὃ € M,(T(N), Κι). It follows immediately from Theorem B that the k; 
approach a limit k in X = Z, x Z/(p — 1)Z, which is independent of the 
particular sequence of modular forms f“ approaching f and is called the 
weight of f, The set of p-adic modular forms over K of level N and weight Καὶ is 
a K,-vector space @x(I'(N), k) closed under limits. Moreover, the p-adic 
modular forms of level N and weight k over all finite extensions of Q consti- 
tute a vector space over Q,, denoted .@(I(N), k). Any member of 
M(T(N), k) can be called a p-adic modular form of level N, without regard 
to its particular field of definition, and p-adic modular forms of level 1 can 
simply be called p-adic modular forms. 

Two important corollaries may be deduced from Theorem B. 


Corollary C Let f=> 9 α,4"ε M(T(N), k) with k = (t, s)e Z, x 
Z/(p — 1)Z. Then: 


(1) ifs #0, v,(ao) = inf, 1 ας; 
(2) ifs =0, v,(ao) = inf, >1 a, — vp(t) — 1 — 5, ». 


Corollary D Let f® = V9 aq" € (Γ(Ν), k;); i= 1, 2, .... Sup- 
pose that the a tend uniformly toward a, € Q, for n > 1, and the k, ΣΡ ΘΗΝ a 
nonzero limit k in X. Then the af) have a limit ageéQ,, and 


f= ΣΧ α,4" € M x(T(N), k). 


Corollary C follows directly from Theorem B upon approximating f by 
f™ € M(T(N), ἢ), letting f(@ =a) ¢M(I(N), 0), and observing that 
vf? —f) = inf,, , v,(a,). For the proof of Corollary D, first recall that 
the Eisenstein series E, = 1 mod p™ if k =0 mod(p — 1)p"~* (p # 2) and 
E, = 1 mod 2” if k =0 mod 25 2, Since K is a finite extension of Q, K, is 
complete. To show that the af) converge, it therefore suffices to show that 
they form a Cauchy sequence. Since lim,.,, Κι = k = (t, 5) # 0, either s # 0 
and k; # 0 mod(p — 1) for i large enough, or t # 0 and u,(k;) is bounded. By 
Corollary C, the v,(a{)) are thus bounded, and upon multiplying all the f 
by a constant, one may assume that v,(f) > 0 for each i. By assumption, if 
i, and i, are large enough,  inf,,, v,(a“?—a%)>M and 
Κι, = Κι, mod(p — ner for given M. If k;, > Και, say, then Corollary C 
applied to f"?) — f") ε M(T(N), k;,) yields v,(a$? — ab?) > M if 


kiy —ki2 
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Κι, #0 mod(p— 1) and va}? — αὐ} Μ — v,(k;,)-—2 if k,,=0, as 
desired. Finally, fe @x(I'(N), k) by the definition of p-adic modular forms 
over K of level N and weight k. 

Our first two propositions show that no essentially new reduced or 
p-adic modular forms arise when the field of definition is extended. Con- 
sequently, a reduced or p-adic modular form is defined over K precisely 
when its coefficients are all contained in K or K,. 


“Ὁ 


Proposition 1 ἡ (Γ(Ν), k) = υἦὐ (Γ(Ν), Κ) Θο Καὶ 

Proof Let wy, ..., W, be an integral basis for K, over Q,. If 
fe M,(T(N), k), let fe «(ΓΝ ), k) have reduction ἢ Then f = Σὺν Si 
with ἢ, € Ox [[gq]] a Q,-linear combination of members of .@,(I'(N), K). 
Since yn i may be approximated by a member of @(I(N), k), 
f,€ M{T(N), k) and f=); ἦν; fj ε Ml (N), k) @g K 


τ If f= 0 a,g" € M(T(N), k), then fe Ari) k) if and 
only if each a, € K. 

Proof Only the “if? assertion requires proof. By the proposition, 
f=), ας ἦς, with ἢ, ε WM (Γ(Ν) k) linearly independent and c; Ε F,. Since f 
and the ἦ have all ΒΡ ΣΕΙ͂Σ in K, the c; are also contained i in K, say by 
Cramer’s rule, so fe υἤκ(Γ(Ν), k) as desired. 


Proposition2 .@(I'(N), k) = οὔ (Γ(Ν}. Κ) @op Kp =  (Γ(Ν). Κ) ΘΟ Κα 


Proof Itis enough to show that iff = 11π|;., f withf® € @,(I'(N), k;), 
then f is a K,-linear combination of members of 4 (I'(N), k). Let w,,..., w, 
be a basis for K, over Q, and write f=), f;w;,f = > fw,, where the/f, 
and [ἢ have coefficients in Q,. The f? are Q,-linear combinations of modu- 
lar forms over Q, hence elements of 4 «(ΓΝ ), k;). Recall that the norm 
defined by || δ), α;νν;} = Σ᾽; |a,|, is equivalent to the v,-norm on K,. Thus 
>; SPwi> >; fw; forces f to approach ἢ for each j, that is, 
f,€ MV (N), k) and f= Σ; ἤν; € M(I(N), Κ) Ὁ K,. 


Corollary 1 If f=). a,q"€ M(I(N), k), then fe Mx(T(N), k) if 
and only if each a, € K. 

Corollary 2 A p-adic modular form with coefficients in Q, is a p-adic 
modular form in the sense of Serre. 

Proof Corollary 1 is proved in the same manner as the corollary to 
Proposition 2, and Corollary 2 is of course a special case of Corollary 1. 


Consider the linear operators U, V: Q,{[q]] > Q,[[q]] defined by 


> a,g"|U = )a,,q" and Σ᾽ a,qg"|V = > a,q?". 
n=0 n=0 n=0 n=0 
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Proposition 3 If f= δ a,q" € Mx(T(N), k), then 
f\|U,f|V € Mx(E(N), k). 


Proof By Corollary 2.1, it suffices to show that U and V map 
M(T(N), k) into itself. Since U and V commute with limits, this would 
follow if U and V mapped -4@(T(N), k) into @(I(N), k) for k € Z. There is a 
decomposition 

M(T(N) kK)= Ὁ M(T(N), x Κ) 
xe(Z/INZ)* 
where 
a ὃ 
MEW) 18) = [γε MTD Ὁ] Fife Gg = AOS 


d 


Hence, all one needs is f |U, f |V ε M(T(N), k)forfe M(T(N), x, k). Write 
kn = (Ὁ — 1)p"~' and f,, = E,,, f€ M(U(N), χ, k + km), SO Km + 0 and f,, >f 
as m-—>oo. By the theory of Hecke operators, f,,|T, ἤ, Ὁ + 
x(p)p***"—*f,,|V € M(T(N), Καὶ + k,,). Therefore 


f\U=limf,| U=limf,|T, ¢ M(T(N), 1) 


completing the proof for U. In particular, f,,|V = χίρ)ρ᾽ ἢ ΚΤ, — 
tn |U)€ M(T(N), Καὶ -- Κι) and f|V = lim f,,|V e (I(N), k), completing 
the proof for V. 


for ᾿ ἢ ει δ == πιοά N} 


In [10], Serre shows that general Hecke operators 7, and differential 
operators © and R,, as well as U and V, preserve p-adic modular forms of 
level 1. As Serre has pointed out to us, © and R, also preserve p-adic 
modular forms of level N. For it follows from the transformation formula for 
E, that if fis ἃ modular form on any subgroup of I, in particular Γ(Ν), then 
so is Of — (wt f/12)E, f, while E, ε (I, 2). The situation for the T, is more 
complicated, however. 


2. p-adic Modular Quasiforms 


For later purposes, a version of Corollary D will be needed which applies 
to certain sums of p-adic modular forms. As a first step in this direction, we 
define the relevant sums precisely. 


Definition f= >> α, 4" € Q,{[q]] is a p-adic modular quasiform over 
K of level N and weight Κα Z, if f=) 2-1 f, with f, ε Wx(I(N), (k, «)). 
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Again, a p-adic modular quasiform of level 1 may be called a p-adic modular 
quasiform. 

The ring of modular forms .@(I) of course has a Z-grading by weight. 
Similarly, the ring of reduced modular forms .@(I) has a 
Z/(p — 1)Z-grading if p+#2,3, because by Theorem A, .@(I)~ 
F [Q, R]/(A — 1), and E,_, = A(E4, E,) has weight p — 1. To put it another 
way, 


p-1 
MT)= OMT) with @wr)= Σ᾽ MAI, ky. 
a=1 k=a(p-1) 

Proposition 4 [f f= )?=; ἢ is a p-adic modular  quasiform, 
vf) = inf, v,(f.)- 

Proof Observe that since .@(I, k) = 0 if 2 +k, only the f, with a even 
can be nonzero. In particular, when p = 2 or 3, a p-adic modular quasiform 
is simply a p-adic modular form, and the proposition is trivial. Otherwise, let 
π be a uniformizing element of a field of definition K for f. Clearly v,(f) => 
inf, v,(f,). After multiplying f by an appropriate power of π, the proof of the 
reverse inequality amounts to showing that if each v,(f,) Σ 0, and ifv,(f) => 
lv,(z) for a nonnegative integer I, then each v,(f,) => lv,(z). This statement is 
trivially true if / = 0, so to prove it by induction on J, assume it 15 true for 
1 = μι. Since the .@“(I) are linearly independent, one sees after approximat- 
ing each f, by a modular form that v,(f) = (m + 1)v,(z) > 0 implies each 
v,(f,) > 0. Hence π᾿ *f= >, x 'f, is a quasiform with v,(x~ 17) => πιυ, (πὶ 
and v,(x~ ‘f,) >0, so by the inductive hypothesis v,(z~ ‘f,) > mv,(z), i.e., 
υν( 7.) = (m + 1)v,(z) for each a. 


Corollary The representation of a p-adic modular quasiform f as a sum 
Pri fa ee MF, (k, «)), is unique. 

Proof It suffices to show that if0 -- δὲ. f,, then each f, = 0, and this is 
the special case v,(f) = οὐ of the theorem. 


Theorem A assures the validity for .@ o(I°) of Serre’s theory of filtration 
and the ©-operator [9]. In particular, if p # 2, 3, the filtration of ©” *(E,,;) 
is ΡΖ — 1. For the rest of this section, g denotes e?””. 


Proposition 5 ΚΕΝ and 
2|k, p=2, 
then 
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Proof By elementary congruences ¢? — ¢ = W where 
Ψ τ᾿ 34 BoE 4) 


ifp + 2,3 andy = —Aif ae 2 or 3. The equation g? — ᾧ = —Acan clearly 
not be solved in .“(I) = dare proving the proposition if p = 2 or 3. Other- 
I 3 εἰν τον ᾧ = ἜΣ with ᾧ,. ε M*(T). As pa =a mod (p-— 1), 

- ᾧ wpe b,) with 6? — ᾧ, € (I). Since ᾧ has filtration 
eae he Ὁ Ὁ ἴον απ p— 1 and 8 - , -- ᾧ for a= p~ 1 But if 
ᾧ,. 1 has any filtration h, @?_, and g?_, — d,_, have filtration ph, which 
cannot equal p” — 1, a contradiction. 


It is now possible to establish an analogue of Theorem B for p-adic 
modular quasiforms. The proof utilizes the Eisenstein-Serre series 
Efe MI, k) defined in [10], which have the property that 
vp(E%,, 0) — 1) = v,(h) + 1. 


Theorem 6 Suppose f = )P=i fM and f® = Y=} f™ are p-adic 
modular quasiforms of respective weights k, and k, with v,(f" —f™) > 
epee r>0. Then v,(k; —k2.)>r—1lifp#2andv, (Κι —k,)>r—2 

if p= 


ἮΝ Assume that K is a common field of definition for f“ and f@ 
with uniformizer x. If p = 2 or 3, f" and f™ are actually p-adic modular 
forms so, by Theorem B, nothing remains to be done. If p - 2, 3, after 
multiplying f and f by an appropriate power of π, one may assume 
v(f\) = 0, whence each v,(f 0”), υ,(5.2) = 0 by Proposition 4. If ky > ky, 
let h =k, —k, and m=v,(h) + 1, so the assertion is m >r. If this were 
not the case, ie, r>m, then the inequalities v,(f{ —f™)>r and 

v(E%o -- 1) Σ μι applied to the equation fE* 5, —f = (7 —f™) + 
ΠΟΣῚ o) — 1) give first v,(f'E* 5 —f™) Σ μι afi then 


po ἜΣ. 0) —f)= = pf OES 0) — 1) mod z. (1) 


Applying Proposition 4 to the p-adic modular quasiform f E%, ο) —f™ of 
weight Κα yields p-"(f'E* 9 -- (9) = δὰ Gas Ga € MT), v an > 0. The 
equation (1) may now be ἐπ ἐπ 


Σ, 9. Ρ »"ἿΣ FO \UE ho) — 1) mod π. (2) 


Since the reduction of a p-adic modular form obviously equals the reduction 
of some modular form, and since 


eee 
ᾧ-- Ap~™(E&® ὁ — 1) with v,(A) = 0, 
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(2) shows that -- Σ᾿ 9,/>., ἢν is in the quotient field of (I). But by 
Proposition 5 and its proof, φῦ -- ᾧ --Κ ε υὔ(Γ) and d¢.M(L), while 
according to Section 1.2 of [10], (I) is integrally closed. This contradiction 
to the assumption m < r establishes the theorem. 


Corollary! If f= >, f@and f® = >, f© are p-adic modular qua- 
siforms and me N, then v,(f — f) = m implies v,(fY — f2) = mfor each 
at. 


Proof Again, the corollary is obvious if p = 2 or 3, and one may assume 
v(f) = 0 otherwise. If f" and f™ have weights Κι and k,, then Theorem 
6 gives h=k,—k,=Omod p™"', so ΚΕ ἡ —fP =f -—fO= 
0 mod p”™. Since fME* ἡ —f7 =, (SMES, 0, -- (2) is a p-adic modular 
quasiform of weight k,, Proposition 4 now gives fO) —f@ Ξὶ ΘΕ ὦ -- 
f? =0 mod p”™. 


Corollary 2 The limit of a sequence of p-adic modular quasiforms 
f=), Ff over Καὶ of weight Κι is a p-adic modular quasiform f over K of 
weight k = lim k,. 

Proof For each a, the f converge to some f, ε K,[[q]] by Corollary 1 
and the completeness of K,. Since each f, € (I, (k, «)), f=), f, isa 
p-adic modular quasiform of weight k. 


Corollary 3 If f=)» a,q" is a p-adic modular quasiform of weight k, 
then v,(ao) = inf,> 1 v,(a,) — v,(k) — 1 — 52, p- 


Corollary 4 Let f = δου aq" be p-adic modular quasiforms over K of 
weight Κι. i = 1, 2, .... Suppose that the a” tend uniformly toward some a, for 
n> 1, and that the Κι approach a nonzero limit k in Z,. Then the af? have a 
limit ag, and f Ξ δ. a,q" is a p-adic modular quasiform over K of weight k. 

Proof The proofs of Corollaries 3 and 4 are analogous to those of 
Corollaries C and D, with Theorem 6 replacing Theorem B. 


We note that Theorem 6 can be deduced immediately from its Corollary 
1. However, there is no obvious way of proving Corollary 1 directly. It does 
seem necessary to use the same methods Serre used to prove Theorem B 
once again. 


3. Modular Forms on I, ;(p’, N) 


Throughout this section, we fix a positive integer N, let p be a prime not 
dividing N, and write (a, b: c, 4) for the matrix [f °]. For each re N, we 
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define three congruence subgroups of I" by 


To(p’, N) = {(a, b: c, 4) ε T(N)|c = 0 mod p’}, 
Vo,(p’, N) = {(a, b: c, 4) ε T(N)|c =0 mod ρ΄, a=d = 1 mod p}, 
Γι(ν", N) = {(a, b: c, 4) ε Γ(Ν) = 0 mod p’", a=d =1 mod p’}. 


We abbreviate Γρίρ') -- Γοίρ', 1), Tor(p") Ξε Γοιίρ', 1), and ΓΙ (ρ') = 
T,(p’, 1). There are isomorphisms T ,(p’*, N)/To,(p’, N) > Z/pZ, 
(a,b: p’~*c,d)roc, and To(p’, N)/To.(p’, N) > (Z/pZ)”, (a, b: ες, ἀ)ν- ἀ, 
where any integer is considered a member of Z/pZ by the natural map. Since 
Z/pZ and (Z/pZ)* are abelian, there are two incompatible decompositions 
of the space of modular forms on I'y,(p’, N): one according to characters 
from I'9,(p""!, N), and the other according to characters from Fo(p’, N). 
Henceforward, € will always denote a character of Z/pZ, extendible to an 
additive character of Z, and y will denote a character of (Z/pZ)”*, extendible 
to a Dirichlet character of modulus p. Then 


M(To1(p’, N), k) = © MT o3(p", N), ξ, k) (r > 2) 
Ee (Z/pZ)* 


MT oi(v',N)kK= Φ οχ(Γρι(ν', Ν), χι Ὁ (r=). 
x €(Z/pZ)** 
Here M(To1(p’, N), & k) = {fe MToi(p, N), k) st. f κ(α, δ: ρ" ᾽ς, d) = 
ξ(ο) for (a, b: p’*e, d) Ε Γοιίρ" , N)} and M (To 1(P", N), Xs k) πῶ 
{fe M(Toi(p’, N), k) 5.1. [ |,(a, b: ς, 4) = x(d)f for (a, b: c, d) € To(p’, N)}. 
Analogous decompositions of course hold for @(T;(p’, N), k). 

The goal of this section is to prove that a modular form of level N on 
Γοίν") is a p-adic modular form of the same level and weight, Le., 
M(To(p’, N), k) < M(T(N), (k, k)). A reasonable strategy for doing this 
would be first to prove it directly for forms on Γροίρ, N) and certain Eisen- 
stein series on I'y(p’, N) having characters with respect to ')(p""*, N). The 
Eisenstein series could then be used to inductively reduce the theorem from 
T'(p’, N) to Γροίν, N). However, since Γρίρ', N) is not a normal subgroup of 
To(p’*, N), the desired characters do not exist. So instead we apply the 
strategy to show inductively that forms on I'9,(p’, N), which is a normal 
subgroup of I'9,(p’*, N), are p-adic modular quasiforms of level N. The 
decomposition of 4(I,(p’, N), k) with respect to characters from I'o(p’, N) 
may then be employed to prove the desired result. 


Proposition 7 Any modular form on Vo(p, N) of weight k is a p-adic 
modular form of level N and weight (k,k), i.e, Mx(Vo(p, N), k) < 
M (I (N), (k, k)). 
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Proof Choose integers « and y such that ap — yN? = 1, let m = pN, and 
write Wz=(pa,N:my,p) so detW=p. Set A= (p,0:0, 1), 
51 = (1, Nj: 0, 1), T = (pu, N: Ny, 1), 


I°(p, N) = {(a, b: c, 4) ε T(N)|b =0 mod p}. 


Write f | for f |, , and f |U* for δῦ, f|A~* |S, sof |U* = pF | U. 
We claim that if fe W(To(p, N), k), then (f|W+f|U*) ε (IN), k). 
First, f | A~* € “(P°(p, N), k) because 


(1/p, 0:0, 1)(x, py: z, w)(p, 0: 0, 1) = (x, y: pz, w). 


Second, the p + 1 matrices T, 57 (1 <j < p) form a complete set of right 
coset representatives of Γ΄ (ρ, N) in Γ(Ν). Indeed, a trivial calculation shows 
that they lie in different cosets, while the set ['(N)/I'°(p, N) injects into 
I'/T°(p), which has p + 1 elements. Putting these facts together, 


f|W4f|Ut=(f|ANIT + YF [AS εὐ MW). 1) 


as desired. Moreover, another simple calculation shows that W normalizes 
Γρίν, N), so that if fe M(To(p, N), k), then f|W~' © M(Io(p, N), k) and 
(f+ |W-*|U*) © a (P(N), k) 

For a fixed even integer | > 4 with (p — 1)|J, let E= E, and g=E— 
pi?E|W ε M(Co(p, N), k). Then g = 1 mod p and g| W~! =0 mod p!*"?. 
In fact, 


E|W= ΕἸ (να, N: my, ΡῈ) = E|(a, N: Ny, p)(p, 0:0, 1) 
= E|(p, 0:0, 1) = p'?E|V 
since (α, Ν: ΝΥ, Ρ)  ΕἾΙΓ, and similarly E|W~'=p’?E|V. Because 
E = 1 mod p, g = E — p'E|V = 1 mod p, and 
g|W-1=(E— p!?E|W)|Wo! = E|W7! -- pl? E 
= p'*(E|V — ΕἸ =0 mod p'*"”. 

Finally, if fe M(IFo(p, N), k), define f,, = fg?" + fg?"|W~ || for each 
meéN. Since fg?" is on Γρίρ, N), f,, is on I'(N) by the remarks above. To 
prove the theorem, one needs f,,—f as m— oo. Write k,, =k + lp” so 
Sn = fg?" + p*~*m!2( fg?")|W-*|U. fg?" +f because g = 1 mod p, and it re- 
mains to show (p!~**’7) fg?"| W-1|U 0. Indeed, 

vp(p? "fg?" |W" *|U) > v,((p'“”?) fg?" |W *) 
=v,(f |W ἢ) + p™v,(g|W-*) + 1-k,,/2 
>v(f |W ')+ 1—k/2 + p"-> o. 
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From now on, let ὦ denote the unique character of (Z/pZ)* such that 
v,(w(x) — x) >0 for all x ε Z. Clearly ὦ generates the character group 
(Z/pZ)**, that is, w', w?, ..., w?~' is a complete list of characters of 
(Z/pZ)*. We always consider each ωἱ to be a Dirichlet character of modulus 
p, so in particular w?~1(n)=0 if p|n, and the L-series L(s, ρ΄ ἢ) = 
(1 — ρ (5). In the future, when discussing modular forms on Γ or M9, (p’), 
q will denote e?”*. But when these forms are see in conjunction with forms 
for Γ(Ν), they should be viewed as series in αὖ with q = e777". 


Proposition 8 Any modular form onT,(p, N) of character w” and weight 
k is a p-adic modular form of level N and weight (k,k+ 7), ie. 
M (Voi (p, N), ἡ, k) «- M(T(N), (Κ, k + y)). 


Proof First note that the series 


G,(w’) = G,(q, w’) = 141 — k, w/) + Ἐν Σ᾽ (sgn υ)ωϊ(υ)υῖ 14! 


n=1 v|n 


lies in M(Io,(p), w/, k), as shown by its expression as a sum of Eisenstein 
series of level p. It follows directly from Theorem 3 of [1] (see Lemma 13 of 
this paper) that if (p—1)|(k+y) and p#2, then v,(L{1—k, w’)= 
v (Bk, /k) = —1— v,(k). Hence, in that case 


E, (a) = L(1 — κι w)'G,(@') = 1 mod pt) ΤΊ, 


- If p = 2, Mx(Toi(p, N), @, k) = Wx(Volp, N), k) « Mx(V(N), (Κ, k)) = 
M (TV (N), (k,k+7)) by Proposition 7. If p#2, suppose 
fE Mx(To,(p, N), ὦ", k) and let k,, = p"(p — 1) + y. Then 


f= LiMn ὦ fE,,,(@”) 
while 
fE,,,(") € M x, (To1(P- N), w'@’,k +k a Ξ M xic,.,\(Uo(P, N), k + k,n). 


Thus, f is a p-adic modular form of level N and weight k+y= 
lim,,+o0 (k + km) by Proposition 7. That fis in fact defined over K asa p-adic 
modular form of level N follows from Corollary 2.1. 


We now commence the second part of our strategy by considering cer- 
tain Eisenstein series. In all that follows, summations such as δ... and ee 
will be over both negative and positive integers v excluding 0. 


Proposition 9 For any integer k > 2 and any character ¢ of Z/pZ, 


H,(é) = ΠΣ δ E(—n/v)(sgn υ)υδ ᾽4" ε M(Toi(P 2), ξ, Κ), 


n=1 υἹη 
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where a%(p)=A(P) Yna1y nn * and A,(p) = p*(k — 1)!/(—2ni). Also 
%(p) € Q(¢,) and if 


2|k, p= 2, 


then v,(a,(p)) = —1— v,(k). 
Proof The Eisenstein series of level p, 


G,(t, c,d, p)=a,(c,d,p)+ > Σ᾽ (sgn vj” * Cera" 
δὰ eee 

are modular forms on Γ(ρ) [8]. Here τ = e?"7/", €, = e?*"/?, a, (c, d, p) = Oif 
c #0 mod p, and a,(c, d, p) = A,(p) Van * if c=0 mod p. Further- 
more. G,(t, c, d, p)|L = G He, (c, d)L, p) for LET, and G,(q, c, d, p)= 
p*G,(t, c, d, p)|(p, 0:0, 1). Hence, G,(q, c, d, p)|x, y: pz, w)= 
G,(q, cx + dz, dw, p) if i; y: : pz, w)€T(p). In particular, if d= 1 and 
(x, y: pz, w) ET ,(p), then G,(q, c, 1, p)| (x, γ: pz, w) = G,(g, c + z, 1, p). 
Therefore 


Ρ Ρ 
> &(—c)G,(q, ὃ... Ρ) | (x, y: pz, w) = ξ(2) >» (—c)G,(q, c, I, Ρ). 
c=1 =] 
ie, )?., €(—c)G,(q, ς, 1, p) € M(To;(p?), ξ, k). The first assertion of the 
theorem now follows upon observing that in fact 


H2) = Ημᾳ, δ) = Σ ξ(-- ἡσι(, & 1, p). 


To show a,(p) € Q(Z,), let σ be an automorphism of C over Q(é,). Since 
H,(€) € M(0,(p?),k) and (I ,(p?),k) has a _ basis over Q, 
H,(¢)" € “(1 ,(p?), k). Thus H,(€) — H,(&)° is a modular form on I’, (p”) of 
weight k. But since all coefficients of H,(é) except possibly the first lie in 
Ο(ζ,). Ημ(ξ) — H,(€)° = «,(p) — %,(p)’, a modular form on I’ ,(p”) of weight 
0. This is impossible unless «,(p)’ = «,(p). 

It remains to evaluate v,(A,(p) δ᾽, π΄ *) when 


2|k, p =2. 
li p= 2, 


y n-*=2(1 —27*)C(k) 


n=1 (p) 
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and 
υρ(λι(Ρ) Σ π΄ ἢ - vp((2" — 1)B,/k) = --ὶ - v p(k). 
n=1 (p) 
If p # 2, 
oe) οο jeven 
Ynk= YY wk=Ae-YY YL enn 
n=1 (p) eye n=1 1sjsp-1 


jeven 


=(2/(p—1)) Σ Lk’) 
1<j<sp-1 
since w7, w*,...,?~ | is a complete list of characters of (Z/pZ)”* /{+ 1} and k 
is even. Now, on the one hand L(k, w?~')=(1—p ")f(k) so 
v (A, (p)L(k, w?~ ‘) = v,((p* — 1)B,/2k) = —1 — v,(k). On the other hand, if 
j#p-—1, there is a fiction equation Ay(p)Lk, a) = t(w/)L(1 — k, @) 
with t(a’) = 225 w!(x)¢% a Gauss sum. Since 


x= 


G,(@/) = L(1 — k, &’) 


ἘΣ Σ wo(sgn veg" © M(Coi(p), ὧν, k) 


n=1 v|n 
< A(T, k -- ἢ 
by Proposition 8, Corollary C gives v,(L{1 — k, @”’) > 0. Putting these facts 
together, 


jeven 


v(A,(P) Σ n-*) = v (Lk, ων *)A,(P) a Σ A,(p)L(k, c’)) 


n= 1 (p) 1<j<p-1 


= —1-1,(k). 


We shall now show that H,(£) € @(Uo1(p’), k) is a p-adic modular quas- 
iform in a way that is consistent with the decomposition 
M(T91(p*), k) = Ὁ, M(To1(p”), x Κ). 

Proposition 10 Let k > 2, ξε (Z/pZ)* and H,(é) = Y?2! Η](ξ) with 
Hy(E) € M(To1(p?), @, k). Then ΗἸ(ξ) ε MT, (k, k + 7). 

Proof For any fe M(To;(p?), k), the w’-component of f is 
f* = (1/(p — 1)) 8225 Οὐ) | R, where R, ε Γ, 


R, = (x71, 0:0, x) mod p?. 


x 


In particular, 


(p — 1)Hi(é) = So oD εἰ - (—c)G,(q, c, 1, Ρ}} Rx, 


x= 
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or after writing ¢(1) = ἔπ and performing a short calculation: 


(> — 1)HY2) = Alp) Yarnn*§ + FY > a" (x)op-™"/*(sgn._ 0)" 1g" 
n n=1 υἱη x=1 
(3) 
This series will be shown to be a p-adic modular form of weight (k, k + y) by 


expressing it as a sum of the known p-adic modular forms discussed below. 
We have seen that 


G,(w') = L(1 — k, w') + 3 > a'(v)(sgn vot *q 


n=1 v|n 


is a p-adic modular form of weight (k, k + i) by Proposition 8. Note that if k 
and i do not have the same parity, G,(w') is ideatically zero. If R,, is Serre’s 
differential operator, 


G,(o', ωὴ = G,(q, ο', ἢ 


ΕΙΣ G,( q, W ‘)|Roo, J) 
= YY οἰζυ)ωϊ(η)ίδρ vot tg" 
n=1 v|n 


is a p-adic modular form of weight (k,k+i+ 2j). Finally, if 
1 -- (, d < Ρ a 1, 


G,(c, d) aoe G,(q, C, d) 


= (0-177 'Y, Σ αἰοωχώσι(, ὦ' ὦ) 


is a p-adic modular quasiform of weight k. 
First, suppose that y#p—1. Then ) 22} οἰ (χ)ζ παν. 0. if 
v — mn/v = 0 mod p, and 
p-1 
Σ, (x)CQ-mn)* = γυ -- mn/v)t(@’) 


x=1 


if v — mn/v # 0. Applying this fact and the functional equation of L(k, @’) to 
(3) gives 


(p — 1)t(@”)” ᾿Η](ξ) = 1{1 — k, w’) + y Σ᾽ w’(v — mn/v)(sgn v)v*~ 1g". 


n=1 v|n 


This expression can be written as the sum of three series, each of which is a 
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p-adic modular form of weight (k, k + y). Indeed, 


(p — 1)e(a”) Ἢ) = U4, 0" w)+ © Σ Τῷ, ng | 


n=1 v|n 
pin piv 
+ » T(v, nt + Σ Y T(r, nt 
n=1 v|n n=1 v|n 
p|n pin piv 


where T(v, n) = w’(v — mn/v)(sgn v)v*~*. The first term is 
{( -- k, οὐ + δ Σ᾽ w"(v)(sgn v)v*~ "gq" = H,(w’)|U | V, 
p|n v|n 


a p-adic modular form of weight (k, k + y). 
The second term is 


> ele — md/c\Gyc, 4) = "Foe — mdjc)'S. ΠῚ 


c,d=1 c,d=1 i, j=1 
The coefficient of a particular G,(w', w’) in this expression is 


p=1-p-1 


dX Dae — md/c)a@'(c)@(d) 
= Σ Tor — mc?d/c)d'(c)@(c7d) 


I 


p-1 p-1 
Σ we) Σ ὦν — πιά), 
c=1 d=1 
and this is zero unless i + 2] = y mod (p — 1). Hence, the second term is a 
linear combination of those G,(w', w’) with weight (k, k + y). 

The third term is 


—m) y yw (n/v)(sgn v)v*~ *q" 


n=1 vjn 
pin piv 


o(—m)S YY. w(n/v)(sgn vot 1g" 


t=1 n=1_ vfn 
ΡΙπ vp(v)=t 


VE 


wo(—m) pa ΣΟ Σ o'(n)or(v)(sgn oo tq 


t=1 n=1 v|n 


p'*- 9G,(@", w’)| Υ]. 


it 


THE EXISTENCE OF P-ADIC ABELIAN L-FUNCTIONS 279 


Since G,(@’, w’) is a p-adic modular form of weight (k, k — y+ 2y)= 
(k, k + y), so is the third term. 

Now suppose y = p— 1. Then )?=; @(x)fe-™"* = p— 1 or —1 as 
(v — mn/v) = Ὁ or #0 mod p. It follows easily that 


ip — 1ymnte) = - [μὴ] Ὁ St + SY θα ttn 


n=1 v|n 


+ ἢ » » (sgn γάτα + ρ λιρ)ρ"" Σ, π΄" 


n=1 vin n 
p[n v—mn/[v=0 


+ SD. Gen opt tet 


=1 ουπ, piv 
v—mn/v=0 


The first term is zero if k is odd and an ordinary Eisenstein series if k is 
even—certainly a p-adic modular form of weight (k, k). The second term is 


Ἐξ  p-1 7 Poke Poh Peds. 5 xt 
Pd» dL Gc, d) = p(p— 1) ΣΌΣ &(c)@(d)G,(o, o’), 
c=1 d=1 c=1 d=1 i,j=l 
c—md/c=0 c—md/c=0 


in which sum the only G,(w', w’) having nonzero coefficients are such that 
i+ 2] =0=y mod(p — 1). Finally, the third term equals 


ΡΣ, (ν' δ  ἢ)}λι(ρ)ρ  "( - Yd n*+ ¥° Σ΄ (sgn γάτα γ' 
oa n=1 vn 
plv 


= p> p™-YG,(w?~')|Vi e WT, k) = ML, (Κ, Καὶ + y)), 
t=1 


completing the proof. 


For any character € and even integer k > 2, let E,(&) = a,(p)” 1H,(6). 
Then E_,(&) = 1/E,(€) isa a modular form on Γι (ἢ 2) of weight 
—k. Moreover, E_,(€) = )?=; E”,(é) with 


Et) = (p— 1)" ὧν @"(x)E_y(2)|Ry 


a meromorphic modular form on I'),(p*) of character w’ and weight —k. 
Each E”_,(€) has a finite-tailed Laurent expansion in 4, valid in some neigh- 
borhood of ico. 


Proposition 11 Let ἔ be a character of Z/pZ and let k > 2 be an integer 
such that 


2\k p= 2, 
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Then the q-expansion of each E”_,(é) lies in Q{[q]] and is a p-adic modular form 
of weight (—k, —k + y). 

Proof Write E, for E,(é), so E, = 1+ p δ, a,q" with a, € Q(C,) and 
v,(a,) > 0, by Proposition 9. Therefore, E?"~* approaches a limit in Q,[[q]] 
as m— oo, and that limit is E_, = 1/E,. But by Proposition 10, E and thus 
ΕΡ 1 are p-adic modular quasiforms. Corollary 2 of Theorem 6 hence 
implies that E_, is a p-adic modular quasiform of weight —k, iLe., 
E_,=> ΕἸ, with E.,¢ MI, (—k, —k+/y)). On the other hand, we 
already know that E_, =) E”, with E’, a Laurent series representing a 
meromorphic modular form on Ig; (p) of character w’ and weight —k. 

By Proposition 10, E, itself equals δ᾽ Εἰ with Εἰ € Q[[q]] both a modular 
form on I'9,(p”) of character w’ and weight k, and a p-adic modular form of 
weight (k, Καὶ + γ). The fact that ()?—, ΕἸ (δ ἢ E”,) = 1 is equivalent, by 
the corollary to Proposition 4, to a system of p — 1 linear equations in the 
p — 1 “variables” E”2,, namely 


EE, = Og, p-1 (1 < β < p ἘΞ 1). 
yity2=B (p— 1) 
And the fact that (ΣΡ ἢ, EL} )O PW, Ef?) =1 is equivalent, by linear 
independence of forms with different characters, to the same system of linear 
equations in the variables E’?,, namely 


EY'E™, = ὃ, p-1 (l<B<p—1). 
yit+y2=8 (p— 1) 

Since the theorem will be proved if Εὖ, = Εν. it suffices to show that 
the constant term of the series represented by the determinant of the system 
of linear equations is not zero. The determinant is clearly a cyclic determin- 
ant with first row (6 = 1) (ΕΡ  ", EP~?,..., ΕἾ). By Eq. (3), the constant term 
of Ej is A,(p) ¥, @’(n)n-*/(p — 1)o,(p). Hence, up to a nonzero factor, the 
constant term of the determinant is given by a cyclic determinant with first 
row (Σ᾽, ©? ‘(n)n-*, ..., ¥, @'(n)n“*). The value of a cyclic determinant 
with first row (a;, a2, .--5 4m) is [ [71 (δε, 4; 62) [7], or in the present case 


iE Σουρ τος F(Z) +e 


We are now ready to prove the main theorem of this section. 


Theorem 12 Any modular form on T9,(p’, N) of character ὦ and weight 
k is a p-adic modular form of level N and weight (k,k + y), that is, 
M x(To1(p", N), @”, k) « MK(T(N), ((, Καὶ + y))- 

Proof By Corollary 2.1, it is only necessary to show 
ὙΠ (Γοι(ρ", N), οὐ, k) < M(T(N), (k, k + y)). In fact we shall prove by induc- 
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tion on r the equivalent statement S(r): If fe W(To,(p’, N), k) and 
f= > Pz; f? with 575) e WT oi(p", N), ὦ», k), then each 

fre M(T(N), (k, k + 7)). 
For r = 1, this is Proposition 8. 

It follows from a trivial matrix computation that if 
g(q) € M(Tox(p"), ἔ, k), then g(q?) = g(q)|V € M(Co1(p"*"), & Κ); and if 
g(q) € M(To(p"), ὦ", k), then g(q)| V € W(Toi(p"* *), w’, k). Consequently, 
E,(q”, f)e M(T9;(p"* 7), ζ, k) and Εἰ(ᾳν", Eye ΑἹ Έ(Γο (ν΄ 2), w’, k). Si- 
milarly, E_,(q’, €) € S(T o,(p"* 7), &, k) and 

ΕἸ ,(q?", ξ) € L(V o1(p" *”), ὦ», [Ὁ 


where «ὧδ᾽ refers to meromorphic modular forms. And of course E}(q”, €) and 
ΕἾ ((4Ρ΄, ¢) are still p-adic modular forms of respective weights (k, k + y) and 
(—k, -—k+y). 

Another simple observation is needed. Suppose that 
fr € M(Toi(p'*, N), ki), fo € (Poi (p"*?, N), ka) 


fa =fi fo € MVos(p"**, N), ky + k). 


Suppose further that f; = Σ᾽ ἢ with fj € W(To,(p'*', N), ὦ, Κι) a p-adic 
modular form of level N and weight (k,, k, + y). Finally, suppose that if 
fr => f, with 2 € (To: (p’*', N), ὦ", k,), then the q-expansion at ico of 
each f3 is a p-adic modular form of level N and weight (k3, Κ3 + y). Indeed 
(k2, Κα + y). Then we claim that if f; = Σ᾽ f% with 


f3€ MTs (p"**, N), ὦ", ks), 


each f3 is a p-adic modular form of level N and weight (Κ3, Κ3 + y). Indeed, 
p-1 
fs - Σ » ΓΙ 
ὙγΞ1 yity2=y7(p- 1) 
with 
SUSY € A(T, (p'*', N), a’, k3), 


yity2=y(p-1) 


and by the uniqueness of character decomposition, 
73 τ reer et FUSPY. 


From the suppositions, {1122 is a p-adic modular form of level N and weight 
(ky ὁ Κ, ky +k. +1 + 72), sof € M(T(N), (k3, ΚΞ + y)). 
It is now possible to assume S(r) and prove S(r + 1). Since 


M(To1(p’*?, N), k)= ® M(Voi(p"™’, N), ς, k), 
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it is enough to verify S(r + 1) forfe M(To,(p'*', N), &, k). Let t = 4(p — 1), 
fi =(/EAq” ", δ) fo = Ε- (ᾳρ ", €). Then the hypotheses of the preceding 
paragraph hold for f, ε @(To,(p’, N), k + ἢ by S(r) while they hold for ἢ 
by Proposition 11. Hence the desired conclusion holds for f; =f, f, = 
7: f= ΣΡΞῚ f? with f’ εὐ W(To,(p’**, N), ὦ, k) a p-adic modular form of 
level N and weight (k, k + y). 


4. p-adic L-functions 


Using Theorem 12, we can deduce a “constant term theorem” for certain 
forms on Γ ,(ρ', N), which will in turn be used to establish the existence of 
p-adic abelian L-functions. First recall that there is a natural decomposition 
(Z/p'Z)” =~ (Z/pZ)* Φ (Z/p’"*Z), p # 2. Correspondingly, any character χ 
on (Z/p"Z)* may be expressed uniquely as χ = 7; ¥2, where y, is a character 
of (Z/pZ)* and x, is a character of (Z/p’” 'Z). Two lemmas will be needed, 
the first of which is a restatement of known results on generalized Bernoulli 
numbers. 


Lemma 13 Let y= 7,7, be a character of (Z/p'Z), p # 2. Suppose 
ΚΕΝ, x; = @! and (p — 1)|(j + k). Then: 

(1) if X2 = 1, v (By /k) a aa v p(k); 

(2) {χε #1, v,(By /k) = —1/(p — 1)p"-? where p” is the conductor of x. 

Proof If v is an integer not divisible by p, write v = a(v)<v>. If 
¥, Ξε χῳφ = 1, the lemma is a special case of the theorem of Clausen—Von 
Staudt. Otherwise, consider the ideal p = (p, 1 — x(g)g*) in Ο(ζρ--1)»}» 
where g is a primitive root mod p. Then p=(p, 1 — w(g)}yx2(g)o(g) 
<g>“) = (p, 1 — x2(g)<g>*) # (1), since x2(v) is a p’th root of unity and 
v(<v> —1)>0 for all v. If y2=1, Theorem 3 of [1] thus implies 
pBk = (p — 1) mod p, so v,(pBk) = 0 and v,(By /k) = --ἴ -- v,(k). On the 
other hand, if y, #1, the same theorem states (1 — x(1 + p))By/k= 
1 mod p. Since x is of conductor p”, χί! + p) = x2(1 + p) must be a primitive 
(p"~*)th root of unity, so v,(B* /k) = —v,(1 — x(1 + p)) = —1/(p — 1)p™’. 


Lemma 14 For y = wy, a character of (Z/p'Z)™, p # 2, let 
φι(χ) -- do DY xv) “a”. 


n=1 v|n 
If (p — 1)|(k + J), then O,(x) ¢ WT ,(ν', N), χ, k). (Here the summation is 
over positive v.) 


Choose an integer / large enough so that m = (p — 1)p' — k > 4. Since 
v(Em(Z) -- 1)» 0, A(x) M(Ui(v", N), xk) would imply $,(x) = 
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Pi(X)En(Z) € MC o(p, N), (p — 1)p'). Letting 5 = (p — 1)ρ', 
χ(υ)ῦδ 1 = Φ(υγχο(υγῶ(υ}}  'φῦ»" 1 = @(vP*** = ὡ(υ) ἢ 
Hence φΦκ(χ) = ᾧ.(1), and it suffices to show 


Φ. = Φ.(1) -Σ de i 'g" ¢ M(To(p’, N), 5) 

Assume the contrary, i.e., @, ε W(Io(p’, N), 5). By Theorem 12 any mod- 
ular form on I'9(p’, N) may be approximated by forms on Γ(Ν). In particu- 
lar, 6,¢.M(I(N), ἢ) with s =t mod(p — 1) and t > 4, which in turn imp- 
lies ¢,¢.M(I(N), t). If @, has reduction ¢, and G, = —B,/2k + 
Σ Yon U'g", then f= G, — φ, ε M(I(N), t). However, v,(ao(f)) = 
v,(ao(G,)) = —1 — v,(t) while inf,, , v,(a,(f)) > 0. Therefore, 


Vp(4o(f)) < Ἦν υρία,( [}} -- v(t) -- 1, 
contradicting Corollary C. 


Theorem 15A Suppose x is a character of (Z/p'Z)” , p # 2, with conduc- 
tor p™ and y, = @’. Let 


{= Yana € M(T,(p’, N), x, k). 


(1 ) If ( p— 1) } ( (k ," 3 then υρ(αρ) = inf, > 1 υρ(α,). ὶ 
ΠΣ ) If (p = 1) [(Κ + j) and X2 > 1, then υρ(αρ) = inf,> 1 υρ(α,) a 

(3) ¥ (p—1)|(k+j) and x2 #1, then υρ(αρ) = inf,» 1 v,(a,) — 
i/(p — 1)p™~?. 

Proof Suppose (1) is not true, so we may take ag=1 and 
inf,> 1 Up(a,) > 0, ie, f= 1+), ¢,q", x a uniformizer for a field of 
definition of f/ Then f”=1 moda, that is to say, v,(ao(f”)) > 
inf, 1 Up(a,(f”)). Since the values taken by y and y, are respectively 
(p~1)th and p’th roots of unity, y?’=y,, while y? = 1. Hence 
75 € MToi(p’, N), w’, p’k) < M(T(N), (pk, k + ἢ) by Theorem 12. The 
fact that v,(ao(f”)) > inf, , v,(a,(f”)) is therefore a contradiction to Cor- 
ollary C. 

Case (2) follows directly from Theorem 12 and Corollary C. Finally, 
suppose (3) is not true, so we may take ay = 1, inf, , v,(a,) > 1/(p — 1)p™ 
and k > 4, after replacing f by f- Ε0ρ-- 1) if necessary. Let g = L(1 — k, x) f 
SO dao(g) = [1 — k, x) while inf, , v,(a,(g)) > Ὁ by Lemma 13. Therefore, 
ᾧ, = (G,(x) — g)~/2 ε M(T1(p’, N), x, k), which contradicts Lemma 14. 


There is also a decomposition of (Z/p'Z)* for p = 2, namely (Z/2’Z)* - 
(Z/2*Z)* Φ (Z/2’-?Z). And there is a corresponding decomposition 
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X = %1X2 Of any character y of (Z/2"Z)*. For p = 2, w previously repre- 
sented the trivial character of (Z/2Z)*. Henceforward, let w instead repre- 
sent the nontrivial character of (Z/27Z)*, so x, = w' or w?. As an analogue 
of Theorem 15A, we have for p = 2: 


Theorem 15B Suppose y is a character of (Z/2"Z)* with x, = a’. Let 
= )a,q"€ M(F,(2’, N), χ, ky). 
n=0 


(1) If 24 (k Ὁ ἢ. then v4(do) > infys 1 v_(d,)) — p(k) — 2. 

(2) If2|(k +j) and x, = 1, then v,(ao) = inf, 1 v,(a,) — vp(k) — 2. 

(3) If2|(k + Jj) and x, # 1, then v,(ao) = inf, , v κ(α,) - 17pm 3,2” the 
conductor of y. 

Proof If (1) or (2) is wrong, we may assume v,(f — 1) > v,(k) + 2. Then 
75 € M(To(p", N), p’k) satisfies v,(f” — 1) > v,(k) +2 +7, or 


vp(ao(f”)) < mu v,(a,(f”)) — v,(p"k) — 2, 


which contradicts Corollary C. 
To prove (3), we begin by showing v,(ao) > inf, 4 v,(a,) — 1. First 
observe that 


= ¥ Lx) )ῦ 5 1g" 4 M(T (Pp, N), x k). 


n=1 v|n 


Indeed, if the contrary held, 


PAX)” = V1 Yan 1g?" © M(Vo(p’, N), pk), 


which implies 6? ¢ (I(N), t) for some even t > 4. Then f= G,|V" -- 
po” € MU(I(N), 1) and v,(ao(f)) < inf, 1. vp(a,(f)) — v,(t) — 2, a contradic- 
tion to Corollary C. Now if ge (I ,(p’, N), x, k) is such that 
v,(ao) < inf, v,(a,)-— 1, we may take k>4 and v,(g—1)>1. By 
Theorem 3 of [1], gern -- k, x)) = 0, so 


h(x) = (G,(x)/2 — Ll — k, x)g/2)~ εἤ(Γ,(ν’, N), χ, Κ), 


violating the previous observation. 

To show that in fact v,(ao)> inf v,(a,)— 1/p"-°*, suppose 
fe MT, (p’, N), x, k), vp(f— 1) > 1/p"~* with x of conductor p”, m > 3. 
Then v,(f?" * —1)>1 with f?"-° e (I ,(p’, N), x?" ἡ, p™ *k), which is 
impossible because (χρη 2} # 1. 

It should be possible to replace the conclusion of Theorem 15B(1) with 


v,(ao) = inf v,(a,) — 1, completing the analogy with 15A(1). However, we 
have not been able to prove this if y has conductor 8. Theorem 15A has the 
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important consequence that if y has conductor p™ with m > 1 (p # 2), then 
v,(ao) => inf,> 3 v,(a,) — A with A independent of k. This phenomenon is 
related to the fact that modular forms on Io(p’, N) with such a character 
need not be p-adic modular forms of level N. 

The next theorem, an analogue of Corollary D, is the final payoff of our 
theory of forms on Γρο(ρ', N). It provides the necessary power to establish the 
existence of p-adic abelian L-functions. 


Theorem 16 Let 
{ὃ -- Σ᾽ age Mx(T,(v, N), x, kj), i ee ae 

n=0 
If p # 2, suppose each x = wy, with x a fixed character of (Z/p’~*Z). If 
p == 2, suppose each yx = y, a fixed character of (Z/p’Z)” , and write j; = 0 for 
each i. Finally, suppose that the a“ tend uniformly toward a, for n > 1, and the 
(k;, k; + j;) approach a limit in X, nonzero if y= 1 or p= 2. Then the a 
approach a limit ag. 

Proof After replacing each f by fE.,- κε (00, we may assume 
79 € M(T,(p’, N), x k;) for a fixed character y. The a® still converge uni- 
formly for n > 1, and the k; approach a limit in X, nonzero if x = 1 or p = 2. 
The proof is now the same as that of Corollary D, with Theorem 15 replac- 
ing Corollary C. 


We now take as our object of study p-adic abelian L-functions. 
Henceforth, let K be any totally real number field, L any abelian extension of 
K, and G the Galois group of L over K. If y is a character of G, there exists an 
ideal f of K called the conductor of y, such that y induces a character on the 
ray class group R, = I,/P,. Here I, is the set of ideals of K prime to f, and 


P, = {(a) ε|α € K, a >0, Ξ 1 mod ft. 


Conversely, by class field theory, if f is any K-ideal and χ a character of Κι, 
then χ comes from a character of some abelian extension of K. 

If f is the least common multiple of ἢ and (p), then a fortiori χ induces a 
character of Κρ. For s > 1, an abelian L-function is defined by L5(y, s) = 
ae hj x(a)" * where the summation is over all integral ideals and 1 , and N 
denotes the norm from K to Q. L(y, 5), or simply L(y, 5), has an analytic 
continuation to the whole plane, with a possible pole at s = 1, and has a 
functional equation. As observed by Klingen [5], the values of L(y, s) at 
negative integers s are contained in the same finite extension of Q as the 
values of χ. It therefore is meaningful to define a “p-adic abelian L-function” 
from Z, to Q, by p-adic interpolation. In order to be more precise, let 6 be 
the character of R; defined by w(Na) and discussed below. Let d = [Κ(ξ) : K] 
where C=C, if p#2 and (=, if p=2. 
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Theorem 17 Let Καὶ be a totally real number field, L an abelian extension 
of K, and y a character of Gal(L/K). Then there exists a unique continuous 
function L,(y, 5) from Z, to Q,, possibly undefined at s = 1, such that 
L(x, 1 — n) = Lk(y¥0~", 1 —n) for ne N. In particular, at any negative 
integer s = 1 mod d, L,(y, s) = Lx(y, 5) 


Proof Let us first consider the character 0, following [2]. Denoting the 
abelian extension [(ζ) of Καὶ by Lo, define a character, also written 0, of 
Go = Gal(Lo/K) as follows: If 9 € Gp, choose an ideal re J, such that 
p = (t, Lp /K) and let 6(p) = w(Nr). This is well defined because upon res- 
tricting to Ο(ζ), (t, Lo/K)=p=(t', 1.0.) Κ)ὶ implies (Nr, Q(0)/Q) = 
(Nr’, Q(¢)/Q), which implies w(Nr) = w(Nr’). Since this new character of Gy 
clearly induces @ on J, and (p)|f, @ is a ray class character of R. Moreover, 
(x, ΚΟ ΚΥ Ξ 1 so (Nr, Ο(ζ)7Ο} Ξ 1 and θ(ρ} --  ὡ(Ντ} = 1. Thus 
L(x, 1 — n) = {{χθ7",1 — n) for all πῈ Ν gives L,(y, 1 -- π) = L(y, 1 -- π) 
for ἢ = 0 mod d, 1.e., the second assertion of the theorem is a special case of 
the first. Finally, for use below, note that if σ is any embedding of Ly into C 
and y is complex conjugation, then θ(σ΄ 'yo) = —1 because σ΄ 'yo(f) = ζ '. 

Since there is a natural map Gal(L» /K) > Gal(L/K), 7 can be viewed as a 
character on Gp without changing the induced character on R; or the func- 
tion L}(y, 5). First, suppose that x is not real, that is, for some embedding o 
of Ly into C, χίσ *yo) = —1. Whether n is even or odd, the I-factor in the 
functional equation of L(y@~", s) corresponding to o then has a pole at 
1 —n, and L(y@~", 1 — n) is always zero. Hence, if x is not real, L,(y, s) = 0 
satisfies the conditions of the theorem. And whether or not x is real, L,(y, 5) 
is unique because the negative integers are dense in Z,. 

Now suppose that χ is real, so the generalized conductor of x contains no 
infinite primes. Then y((«)) = 1 for α ε K, «=1 mod f; and 70~"((a)) = 
(sgn Na)" for such an « and any neN. If (f)=Z af, f>0, then yo" 
induces a Dirichlet character modulo f, also denoted y@~", in a natural 
fashion. Moreover, by a theorem of Kloosterman-Siegel [10], 


Gk(x0~") = L(x0~", 1 — ἢ 


£2 YT x(alNaj*q" © MPS), x07, tk) 


n=1 x,a 


if n and k have the same parity. Here τ = [K : Q] and the inner summation is 
over pairs (x, a) withae I] ,x ed ‘a,x >0, and Tr x = n. Writing f= p’'N 
with p+N, χθ" induces ἃ character of (Z/p’Z)”", and 
Gk(y0-") © MT (ρ’, N), x07", tk). 

For x real, first suppose p + 2. Given s ε Z,,s # 0, choose a sequence of 
positive integers k; approaching s such that (p — 1)|k;. As i oo, the non- 


THE EXISTENCE OF P-ADIC ABELIAN L-FUNCTIONS 287 


constant terms of the ve ) uniformly approach limits, specifically, 
X χία)Ν (α)" " > ΟΣ χία)γυ(Να) 'ζΝα» ". 
Trx=n Trx=n 


Therefore, by Theorem 16, the constant terms L(y, 1 — k;) also approach a 
limit, call it L,(y, 1 -- 5). Theorem 15 shows L,(x, 1 — 5) is a continuous 
function of s because if v,(k{?) — k{”) is large, so is 


inf vp(a,(Gk'(x) " GE? ()Ειν.- κι). 
51 


Thus it only remains to show L,(x, 1 -- n) = [{χθ΄", 1 -- for nEN. If 
κ,- hee i dae )|k;, then Σ᾽ χία)Ν (ay * - δ᾽ χ(α)θ(α) *< Nay"! = 
Σ χθ" γ᾽ Since N(xOx) = (x) for x ε Z, the ὌΝ induced by 


θ on eer a οἷ. Hence 
Si 7 G(x) > GE(xO "Ειᾳ,-- (@™) Ε MT ,(ν’, N), χ: tk;), 


and applying Theorem 15 to f; as i— oo gives the desired result. 

Finally, suppose x is real and p = 2. Given a nonzero s € Z,, choose 
Κι > 5, the k; necessarily even if s € 2Z, and necessarily odd if s ¢ 2Ζ,. By 
the same reasoning as above, the constant terms of G(x), respectively 
Gi(x8~'), approach a limit if 5 ε 2Z,, respectively s ¢ 2Z,. Moreover, as 
above, if this limit is denoted L,(y,1—s), then L,(y,1—n)= 
L(yo~", 1 — n). L,(y, 1 — n)is continuous on 2Z, — {0} and on Z, — 2Z,,s0 
it is indeed continuous on all of Z, — {0}. 


Theorem 18 If the Dirichlet character induced by y has conductor divi- 
sible by p, then L,(y, s) is defined at 5 = 1 (p # 2). 


Proof If the p-conductor part of the induced character is w/y¥, then 
either j #0 or yf) # 1. Theorem 15A(1) or 15A(3) may then be used to 
establish the convergence of L(y, 1 — Κι), Κι > 0, as above. 


Theorem 18 should remain true under the weaker hypothesis that the 
Dirichlet character induced by y is nontrivial. Indeed, L,(y, 5) is probably 
defined at s = 1 if and only if χ itself is nontrivial, but the proof of this no 
doubt requires more advanced techniques. 
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A Characterization of the Line- 
Hyperplane Design of a Projective Space and 
Some Extremal Theorems for Matroid Designs? 
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Combinatorial characterizations of some incidence structures obtained from projective spaces and 
affine spaces over finite fields are given. Also some extremal theorems for matroid designs are 
proved. 


1. Introduction 


An incidence structure is a triple (X, “3, I), X and # are finite sets, and 
I< xX x 2. Elements of X are called treatments, while elements of & are 
called blocks. If (x, B) € I, we say that x is incident with B, and denote it by 
x € B. We shall follow the usual notations of incidence structures, and often 
consider a block as the set of treatments incident with it. If D and D’ are 
isomorphic incidence structures, we write ἢ ~ δ΄. For any B ε &, we shall 
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denote by |B| the number of treatments incident with it. Incidence struc- 
tures are also called designs; for various definitions connected with in- 
cidence structures, see Dembowski [4]. 

An incidence structure D = (X, &, I) is said to be BIBD (ὑ, b, r, k, A) iff (if 
and only if) it satisfies following conditions: 


(a) [|X| =v, [923] =b. 
Ὁ) |B] =k forall Be &. 


( 
(c) Each treatment is incident with exactly r blocks. 
(4) Every pair of treatments is incident with exactly 1 blocks. 


Parameters (v, b, r, k, A) satisfy the equations 


A(v — 1)=r(k -- 1) (1.1) 
bk = rv (1.2) 
b > v (Fisher’s inequality). (1.3) 


A BIBD (uv, b, r, k, A) with v = b and hence r = k is called an SBIBD 
(v, k, A). For further results on BIBDs, see [4]. 

An incidence structure G = (X, &, I) is called a simple graph iff | B| = 2 
for each Be &, and if B,, B, ε &, B, # B,, B, and B, are distinct treat- 
ment sets. Treatments of a graph are also called vertices, while blocks are 
called edges. Two vertices x and y are said to be adjacent iff there exists an 
edge incident with both x and y. A simple graph G = (X, 93, I) is said to bea 
strongly regular graph G(v, n,, pti, p71) [2], iff it satisfies the following 
conditions: 


eo Es Neat | 

(B) Each vertex is incident with exactly n, vertices. 

(C) Ifx, ye X,x Fy, there are exactly pj, or ρὲ, vertices Ζ such that z 
is adjacent to both x and y according as x and y are adjacent or not adjacent. 


Let X be a set of v elements and G = G(v, n, pj,, Pi;) be a strongly 
regular graph with vertex set X. An incidence structure D = (X, 93, 1) is 
called a PBIBD (v, b, r, k, A,, 42), with association graph G(v, n,, pi 1, P71) iff 
following conditions are satisfied: 

(i) |X| =», |B] -- δ. 
(ii) |B| =k for each Be &. 
(11) Each treatment is incident with exactly r blocks. 


(iv) Any two distinct treatments x and y occur together in exactly 4, or 
A, blocks of D according as x and y are adjacent or not adjacent in G. 


We shall denote the association graph of a PBIBD D by G(D). For 
various examples and results on PBIBDs, see [4]. We shall denote by 
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PG(d, 4), a projective space of dimension d over a finite field GF (q), while by 
AG(d, q) we shall denote a d-dimensional affine space over GF(q). 

For d>m>12>0, define an incidence structure P, ,,(d, 4) as follows. 
Treatments of P, ,,(d, 4) are all [-dimensional subspaces of PG(d, 4). An 
incidence relation is given by containment. A, ,,(d, q) is similarly defined by 
taking AG(d, 4) instead of PG(d, q). When no confusion arises we shall also 
write P, ,, for P;,,,(d, 4) and A, ,, for A,_,,(d, q). The following results are well 
known and can be proved easily by using common properties of projective 
and affine spaces: 


(a) Po m(d, 4) and Ag ,,(d, 4) are BIBDs for all d > m> 0. 


(1.4) 
(Ὁ) Ifd >3, P, 4- κ(4, q) is a PBIBD (v, b, r, k, 4,, A2), where 


(4 -- 1)(q* -- 1) ΒΞ πέρι ee aa 

(q* — 1)γ(ᾳ -- 1) ᾿ ᾷπι- 4--ἰ 
«-- 4΄ = Mat * = 1) λ _f "-Ι λ 4" -Ι1 (1.5) 

q?-1q-1) > "*  q-1?* ἢ 4-| 

G(P, 4-1) is a strongly regular graph (v, n,, pj1, P71) where 
d-1 ὁ t d όό 1 
q 
m= (a+ te( 4), a ee 

Pi1 = (ᾳ + 1). (1.6) 


Any two vertices in G(P, ,_,) are joined by an edge iff they are intersecting 
lines in PG(d, q). 

By “replacing” a desarguesian plane by nondesarguesian plane in 
PG(d, 4) or AG(d, 4) one can construct BIBDs, with the same parameters as 
those of Po,» OF Ao, », but nonisomorphic to Po ,, OF Ao, m, respectively (see, 
for example, Mavron [8]). Dembowski and Wagner [5] proved a characteri- 
zation theorem for Po 4-- (4, 4) for d > 3. In Section 3 we prove the follow- 
ing characterization theorem for P, 4. ,(d, 4) for d > 6. 


Theorem 1 Let d and q be positive integers, d > 6, q > 1. Suppose D, is a 
PBIBD (1, b, r, k, A, Az), where parameters are given by (1.5), and G(D,) has 
parameters (v, ny, Pi1, P?1), given by Eqs. (1.6), then q is a prime power, 


D, ΞΡ, a-1(4, 4) and G(D,) = G(P,, 4-1). 


We define a combinatorial geometry (or matroid) by the “hyperplane 
axioms.” An incidence structure D = (X, &, I), is said to be a combinatorial 
geometry iff it satisfies following conditions: 


(a) Given H,, H, € 93, H, #H,, H, is not a proper subset of H,. 
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(Ὁ) Given H,,H,¢€ 9, H, #H,, and x eX, there exists a block 
H; € 93, 5.1. (such that) H; ὦ H, ὦ {x} CH. 

(c) For each x e X, 0H = {x}, where intersection is taken over all 
blocks H s.t. x € H. 


Treatments of a combinatorial geometry are called points, while blocks 
are called hyperplanes. Combinatorial geometries have been studied in 
detail, and for various results on combinatorial geometry, we refer to Crapo 
and Rota [3]. We shall follow the notation of Crapo and Rota [3]. 

A combinatorial geometry D = (X, &, 1) is called a geometric design iff 
D is also a BIBD. A regular geometric design is a combinatorial geometry in 
which all flats of rank i have the same cardinality m;. It can be easily seen 
that a regular geometric design is indeed a geometric design [12]. Geometric 
designs have been studied by Edmonds, Young, and Murti [12], in particular 
they have given many examples of such designs. We list a few well-known 
examples of regular geometric designs. 


(a) At-— (v,k, A) design [4] with A = 1 is a regular geometric design of 
rank (t + 1) 5.1. 


m=i for O<i<t and m,=k. 
(Ὁ) Po.,, is a regular geometric design of rank m + 1 s.t. 
.4-| 
αὐ τε 


(0) Ao. » is a regular geometric design of rank m + 1 s.t. 


for O<i<m<d. 


i 


m,=q'! for O<i<m<d. 


We note that my = 0, m, = 1; and m, is the size of a line in any regular 
geometric design. Clearly m, > 2. 
In Section 4 we shall prove the following results. 


Theorem 2 If D = (X, &, I) is a regular geometric design of rank n > 4, 
then: 


(4) m;— mj, > (mz — 1)(m;-1 — m;-2) for n >i > 3. 
(b) Equality holds in (a) for any iiffD ~ Po ,(d, q)for some prime power 
4 and d>n+1 or m, = 2 and ἢ is a 3-design. 


Geometric designs of rank 3 are precisely BIBDs with 4 = 1. We also 
study geometric designs of rank 4 in Section 4, and show that all such 
designs are regular and examples (a)-(c) given above are the extreme cases of 
certain inequalities to be satisfied. 
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Theorem 3 If D = (Χ, &, 1) is a regular geometric design of rank n > 4, 
then: 


(a) m,—m, Σ (μι -- 1)4(m;_-, — m,;-,) forn >i>j 2. 

(Ὁ) Equality holds in (a) for any i iff D ~ Po ,(d, 4) for some prime power 
qand d>n-+ 1 or m, =2 and ἢ is a 3-design. 

Theorem 4 Let ἢ = (X, &, I) be a geometric design (v, ὃ, r, k, A) of rank 
4, then: 

(a) v=k mod(A — 1). 

(Ὁ) ἀκ -- 2(λ -- 1) > vu Σ ΔΚ —3(A -- 1) -- ζ(λ — 1), /4k - 

(c) υ- ἀκ -- 2(λ -- 1 iff D is a 3-(v, : 7, 

(4) υ-- ἀκ -- 2(λ -- 1) -- Δ - ἡ /4k—3 iff k =q*+q+1 for some 
power q and ἢ = P, »(n, q) where n is given by v = q"**/(q — 1)ork <7 and 
D is a 3-(v, k, 1)- rdesign. 

(6) 170» λκ —4(A — 1) —4(A — 1), (ἀκ — 3, then v > Ak — (A — 1). /k 
and for k > 16 equality holds iff D = Ao ,(n, q) where k = q? for some power q 
and v = α΄. 


2. Preliminaries 


In this section we shall state various known results on characterizing 
geometries, which we shall be using in the next section. 

Let ἢ be a BIBD (vu, ὃ, r, k, A) with A = 1. It is well known that the dual δ’ 
is a PBIBD (b, v, k, r, 4,, 42) with A, = 1, A, =0 s.t. G(D’) is a strongly 
regular graph with parameters (b, r(k — 1), r— 2+ (k — 1)’, k*). Any two 
vertices B,, B, are joined in G(D’) if and only if |B, © B,| = 1 in ἢ. 

The following result is a particular case of a well-known theorem of Bose 
on partial geometries [2]. 


Proposition 5 If G = (V, E, I) is any strongly regular graph with par- 

ameters (b, r(k — 1), r — 2 + (k — 1)*, k?), where b, r, k are integers s.t. 
r > 3[k(k — 1) + k(k + 1)(k? — 2k + 2)] 

then there exists a unique BIBD D(v, br, k, A) with A = 1,v = r(k — 1) + 15... 
V is the set of blocks of D and G(D') = G. 

Let D = (X, 93, I) be an incidence structure. For x, y ¢ X, x # y, line xy 
is defined by 

xy=()B 

where intersection is taken over all blocks B 5.1. x, γε B. 

The following well-known theorem due to Dembowski and Wagner [5] 


gives a characterization of the incidence structure Po 4. ,(d, 4) obtained 
from PG(d, q). 
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Proposition 6 If D is a symmetric BIBD (v, b, r,k, 2), 2 > 1 5.1. its dual D’ 
is a combinatorial geometry, then D = Po 4- 1(d, 4) for some positive integer d 
and prime power q. 


We note that the dual δ΄ of a BIBD D(v, b, r, k, 4) is a combinatorial 
geometry iff every line of D meets every block of D. 

Let D = (Χ, ¥, I) be an incidence structure. For each block B of ἢ we 
define a new incidence structure ἢ. as follows. Treatments of ἢ are treat- 
ments of D incident with B. Blocks of ἢ are the lines of ἢ contained in B. 
The following results on regular geometric designs of rank 4 are well known. 
The first one is essentially the definition of a projective space. The second is 
due to Buekenhout [1]. 


Proposition 7 If D is a regular geometric design of rank 4 s.t.m, > 3 and 
Ds; is a projective plane for all blocks B of D, then D = Po χ(ά, 4) for some d 
and prime power q. 

Proposition 8 If D is a regular geometric design of rank 4 s.t.m, > 4 and 
D, is an affine plane for all blocks B of ἢ, then D =~ Ag ,(4, 4) for some d and 
prime power 4. 


3. Proof of Theorem 1 


Let d and 4 be positive integers, d > 6. Throughout this section we will 
assume that D, = (X, 93, I) is a PBIBD (ὑ, b, r, k, 4,, 2) with association 
graph G,(v, n,, Pi1, P71), Where these parameters are given by Eqs. (1.5) and 
(1.6). The proof of Theorem 1 is essentially based on the Propositions 5 and 
6. We first prove the following simple lemma. 


Lemma 9 The dual of ἢ, is a BIBD with parameters (b, v, k, r, 4) where 


(4 " — 1)@** -- 1) 
(q — 1)(q* -- 1) 
Proof We have only to prove that any two blocks of ἢ, intersect in A 


treatments. The following two equations are obtained easily by counting the 
occurrences of treatments and pairs of treatments in blocks of D,: 


ae 


Σ᾽ |B, oO B2| = b(r — 1k = or(r — 1) (3.1) 
Bi, BrEB 
and 
: 2 |B γι Β{([8, γι Β4] -- 1) = υπ4,λι4(λ, = 1) 
1: 2¢€ 


Ἔσίυ -- Ἰ -- πι)λείλ, -- 1) (3.2) 
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where the summations in (3.2) and (3.3) are over all pairs of distinct blocks 
B, and B, of D,. Using (3.2) and (3.3) we can derive the following equation 
easily: 
Σ (|B, ἡ Β,] -- AP =0. (3.3) 
Βι, Bo EB 
From (3.3) it follows that |B, © B,| =A for all blocks B, and B, of D,, 
B, + B,. This completes the proof. 


Consider the PBIBD D, = (X, &, I). Let 1 Χ and B ε &. Let t, denote 
the number of treatments /, € X s.t.1, ε Band the vertices | and |, are joined 
in G,. Suppose | € B, then counting the occurrences of treatments of B in the 
remaining blocks of D,, containing | and using Lemma 9 we have 

(Ay — Ut) + (A, — τ) -- Ἰ -- τῇ = (ἡ -- 1) -- 1) 
since 1, — A, = 44 ὁ #0, solving the above equation for t,, we get 
th=(q+ ia") for le B. 
q-—1 
Similarly, if 1 € B, we can obtain the following equation 
Axt, + Az(k = t,) = Ar 


and hence 


ἀ-2 1 ἀ-ι 1 


Thus we have proved 


Lemma 10 For le X and Be &, if t, is as defined above, then t, is 
independent of the choice of B and is given by 
rio a 1 


n= a+ (+ 


| for |EB 


d-1 
ὥ  ΞΞῚ 

d t; =  -.-.- 

an I Pan for l€B 


Lemma 11 Let d>6. There exists a unique BIBD ἢ, = (X,, X, 1.4) 
with parameters 


& ait 1 (q’t! ΗΝ 1)(q" ΝΕ 1) 4, = 1 
4-|ἴ ᾿ (4 -)4--ἢ)ὴ ° q-1 


Proof Since d > 6, the conditions of Proposition 5 are satisfied for the 
graph G, and the result follows from the same. 


q+) s.t. G(D) © G;. 
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From now on we shall assume that d > 6 and D, 1s the design defined by 
the above lemma. We note that blocks of ἢ, are the treatments of D,. We 
define a new incidence structure D, = (X,, &, I) as follows. Treatments of 
D, are the treatments of D,, while the blocks of D, are the blocks of D,. A 
treatment x of D, is incident with the block B iff there exists | € X s.t. x € lin 
D, and le Bin D,. 

We proceed to compute the block size k3 in D,;. Let B be a block of D3. 
For each treatment x € Bin D,, define «, to be the number of treatments ἰ of 
D,s.t.l¢ Bin δι and x € lin D,. We first show that a, = (q7~' — 1)|q—-1 
for each x € Bin D,. Let! be any treatment of D, s.t. 1 € Bin D,. Then using 
Lemma 10 and the definition of D,, etc., we have 


Gs = | 
1 -- -ς- 1 3.4 
Σὰ ὦ (Gar) +4) ven 
where the summation is over all the g + 1 treatments x € / in ἢ). 
Also if for each x € lin D, there is some treatment I’ of D,, I' € Bin D, 5.1. 
x el’ in D,, then again using Lemma 10, etc., we have 


4-1 1 4-2 1 
a. <t,=2 _ τς yay. (3.5) 


η--ἰ 4--ἰ 
Using (3.4) and (3.5) it follows that 


ἀ- 
αν τά{{ 1} +1 forall xel 
η--[ 
ἢ 5 
or there is some x ε 1 in D, 5.1. a, = ae (3.6) 


Suppose there exists x € B in D, 5.1. 
4 --ἰ 
η--[ 


We next show that for all z # x, z € B, a, < q(q* 2 -- 1)/(q -- 1). 
Now if z is any treatment of D,, z ε B, z # x, then using (3.7), if I’ is the 
unique block of D, containing x and z, I’ € Bin D,. Hence using (3.4) for I’ 


we have 
ἀ-- 1 d d—2 
gq =1 qe —1 ge =—1 
ee ey ἘΞ < a 3.8 
<(a+u(* -- =a a( (3.8) 


ἃς = (3.7) 


for allze Bin D3, ΖΞ Χ. 
Now since the block size in D, is (q4 — 1)(q4~' — 1)/(q? -- 1)(q -- 1), 
there is at least one treatment /' of D, 5.1. [’é B in ἢ, but x ἐΠ}΄ in ἢ. 
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Equation (3.6) for [Γ΄ and Eq. (3.8) give a contradiction. Hence 
ἀ-- 2 d-1 
ge *—1 g *—!1 
= q|-——__] + 1 = ——_ 3.9 
: { =e q-1 se 


for all x ε Bin D,. 
Now using (3.9), by counting the number of elements in the set 
{(x, l)|x € lin D,, 1 € B in D,} we have 


Pais Gene ta) 
| ἡ τ )ο- Gane tt? 


Hence 


q° 1 η Ι 
= + = 3.10 


for all blocks B of D;. We can now prove the following lemma. 
Lemma 12 Let d > 6. Then D, is an SBIBD 
giti-1 gi -1 ge "-1 
q-1 ’q-1’ q-1] 


Proof It is obvious that the number of treatments or the number of 
blocks in D, is 


4:1: 14 


are. 
i ι- 
Using (3.10) it follows that each block of D, is of size k;, where 


v3 = 


ἢ 131 
Κι Ξξ ei 
Hence we have only to prove that any two treatments of ἢ 5 occur in exactly 
(q?~* — 1)/(q — 1) blocks. Suppose x, and x, are two treatments of D3. 
Then there is a unique treatment / of D, s.t.x,, x, € lin D,. Now | occurs in 
exactly r = (q*-' — 1)/(q — 1) blocks of D,. Hence x,, x, occur in at least 
(q*- * — 1)/(q — 1) blocks of ὃς. Counting the elements of the set 


{(y, z, B)|y,z€X2,y,zE Bin D;, Be 33, 


we easily derive that in D, any two distinct treatments are incident with 
exactly (q* — 1)/(q — 1) blocks. This completes the proof. 


We can now complete the proof of Theorem 1. We first note that the 
blocks of the BIBD D, correspond to the lines of ἢ. For | € X, let (1) denote 
the elements of Χ; incident with / in D,. It is easily proved that for x, y ε (I), 
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(1) is the line generated by x and y in ἢ. For the sake of brevity, | will also 
denote the line of D,. Using Lemma 10 given any block B of D; anda line! 
of D3, there exists at least one treatment m of D, s.t.m € Band / and mare 
incident with a common block of D,. Therefore any line and any block of ἢ: 
intersect. Thus the conditions of Proposition 6 are satisfied. Hence 
D, ~ Po, a-1(d, 4). It follows that ὃ, ~ P, 4-- (4, 4) and G, ~ G(P, 4-,) 
and q is a prime power. 


4. Geometric Designs 


In this section we shall study regular geometric designs. We shall also 
study geometric designs of rank 4. We shall first prove Theorem 2. 

Let ἢ = (X, &, ἢ be a regular geometric design of rank d > 3. Let 
d>j>3.Let PX be any (j — 3)-flat of ἢ. And Q > P bea /j-flat of ἢ. We 
define an incidence structure Dp g as follows. Treatments of Dp g are all 
(j — 2)-flats that contain P and are contained in Q. Blocks are all 
(j — 1)-flats that contain P and are contained in Q. Incidence is given by 
containment. The following result can be proved easily. In fact the usual 
proof for projective spaces can be extended to regular geometric designs 


[12]. 
Proposition 13 Dp 9 is a BIBD with parameters 


| mj -- Mj-3 (mj — mj—3)(mj — mj-2) Wy ee, ed, 8 ἢ 
mj—2 — mj-3° (mj; — mj—3)(mj-1 — mj—2) Mj-1 — Mj-2° Mj-2 — Mj-3° 


Proof of Theorem2 Let ἢ = (Χ, 93, I) bea regular geometric design as 
defined above. Using (1.2) and (1.3) for Dp, g we have 


m;— M;-2 Re (rat LL 5.8 
Mj;-1 — Mj-2 mMj-2 — Mj-3 

Hence 
m;—™mM;-1 m;-1 — Mj;-2 


for all d>j >3. (4.1) 


mM;-1 τ M;-2 Mm;-2 — M;-3 


Using (4.1) it follows that 


m; — m;- 
ee i ΕΣ for all i>3 

ΠΊ;- 1 τ Mj-2 
hence m; — m;_, > (mz — 1)(m;_, — m;_2) for d >i > 3. This proves state- 
ment (a). Now suppose 


m; — m;_, = (mz — 1)(m;_- , — m;_ 2) for some i. 
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Then we will have equality in (4.1) for all j, i >j => 3. In particular 


13 —M),>= (m, = 1). 
Hence | 
m; = (m, — 1)? + (μι — 1)+ 1. 


Thus if Q is a rank 3 flat of ἢ and 0 denotes the O-flat, then Dg ο will be a 
projective plane for all Q. Now let D, be an incidence structure defined as 
follows. Treatments of D, are the treatments of D. Blocks of D, are all the 
3-flats of D. It is obvious that D, is a regular geometric design of rank 4. If 
m, > 3, then D, satisfies the conditions of Proposition 7 and hence 
D, = Po 2(d, 4) for some d and prime power q. Now it follows easily that 
D~ P, ,(d, q) and d >n + 1. If m, = 2, then any three points determine a 
unique 3-flat of D, and hence clearly they are incident with exactly 


(m,, — m3)(m, — m4) “"" (m, — m,— 2) 
(m,— 1 " m3) os (m,—1 " mM, — 2) 


blocks of D. Thus D 1s a 3-design. This completes the proof of the theorem. 
Theorem 3 is the obvious inductive extension of Theorem 2. 

For the rest of this section we shall assume that ἢ = (Χ, &, 1) is a 
geometric design of rank 4. Let the parameters of D be (v, b, r, k, 2). Since 
rank ὃ) Ξ: 4, 1" 1. 


Lemma 14 All geometric design of rank 4 are regular. 

Proof Let ἢ bea geometric design of rank 4, with parameters (νυ, b, r, k, 
A). Let x, ye X,x # y. Let | be the unique 2-flat containing x, y, 1.6., line xy. 
Now from the properties of combinatorial geometries it follows that given 
zé X,z ¢l, there is a unique block B of Ds.t. 1 ὦ {z} © B. Hence counting 
the number of elements in the set {(z, B)|z ε X — l,.z © B, 1 S B}, we have 


Mk — |1])=v— III. 


Hence 


Thus the cardinality of a line | is independent of the choice of the line /. Proof 
of Lemma 14 is now complete. 

We now prove the final result of this paper. 

Proof of Theorem 4 Using Lemma 14, D is a regular geometric design 
with 
_ Ak —ov 
Al 


mM 


and m3 = Κ. (4.2) 
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Since m, is an integer, v = k mod(A — 1). Also since m, > 2, 
Ak — v > 2(A — 1). 
Hence | 
υ <Ak — 2(A — 1). 
Suppose v = Ak — 2(λ — 1). Then m, = 2 and clearly D is a 3-(v, k, 1)-design. 


Thus we have proved (c). We now prove the remaining parts of Theorem 4. 
Using Theorem 2(a) for i = 3, we have 


k > m,(m, ae 1) + 1. (4.3) 
Hence using (4.1) we have 


vp? — ν(λκ — (A — 1)) + A2k? — (A — 1)?k <0. 


Hence 
o> ak “5 > /k- 0 ))7 + 4(A7k? — (A -- 1)2Κ) 
1.6. 
A-1 iA-1 


This proves (b). Now suppose v = Ak — (A — 1) — ξ(λ — 1). /4k — 3; then 
we will have equality in (4.3) and the statement a) follows from Theorem 
2(b). 

Now suppose v > Ak — 4(4 — 1) — 4(A — 1), /4k — 3. Hence 


Thus if P is any 3-flat of D and 0 denotes the 0-flat, then Dp 9 is not an 
SBIBD. Parameters of Dp 9 are 


m3(m;—1) m3 -—1 
eo ΞΡ 94 1 . 
(ms. m (m2 — 1)’ yao 


Hence 


ms" >m +1 (4.4) 


my, — 
i.e., k > m3. Hence using (4.2) we have 


vp? — 2λκυ — (A — 1)?k + A2k? <0. 
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Hence 


1.€., 


v > Ak — 4, /407k? — 4[(A — 17k + 17k?) 


vp > Ak — (λ -- 1)/k. 


If v = Ak -- (A — 1),/k and k > 16, then we will have equality in (4.4) and 
m, > 4. Hence all designs Dp 9 are affine planes, and the statement (e) 
follows from Proposition 8. This completes the proof of the theorem. 


A natural question arises. Are there geometric designs of rank 4 different 


from those given by cases (c)}-(e) of Theorem 4. We note that using examples 
of Steiner triple systems due to Hall [6], one can construct one such geome- 
tric designs of rank 4 (see also Teirlinck [11]). This example also shows that 
the hypothesis k > 16 in statement (6) is necessary. An interesting problem 
will be to classify all geometric designs of rank 4. 
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On the Behavior of Ideal Classes in 
Cyclic Unramified Extensions of Prime 
Degree 


ROSS SCHIPPERt 


DEPARTMENT OF DEFENSE 
WASHINGTON, D.C. 


If K/F is a cyclic unramified extension of number fields of prime degree p and ‘¥ ,_, x: C\{F) > 
Cl(K) is the ideal class mapping, then #(Ker( ;..x)) is bounded above by p’*”~' where r is the 
rank of E, as a Z-module, w = 1 if F contains the pth roots of unity, and w = 0 otherwise. Noticing 
that this bound goes to infinity along with [K : Q], we ask the question, Can one construct a 
sequence of extensions K/F for which #(Ker(;..x)) goes to infinity along with [K : Q]? This 
question is settled by the following theorem which we prove here: 


Let L be any number field and p a fixed rational prime. Then there exists a cyclic extension F of 
degree p over L which in turn possesses a cyclic unramified extension K of degree p over F 
satisfying p’ divides #(H'(Gal(K/F), Ex)) where r is the rank of E, as a Z-module. 


1. Introduction 


Let F be a number field and K a finite Galois extension of F. The 
extension K/F is said to be unramified if no prime of F (finite or infinite) 
ramifies in K/F. If 91 is a nonprincipal ideal of F, then we say that 9 
capitulates in Καὶ ΓΕ if the K-ideal 21, is principal. There is a natural homo- 
morphism Ψ. κ' Cl{F) > Cl(K) given by Cl(Q)r> Cl(QO,x), where Cl(F) 
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denotes the ideal class group of F and Cl(2I) denotes the ideal class to which 
W belongs. Thus YF capitulates in K/F if and only if Cl() belongs to 
ΚΟΥ .. x). When this occurs, we also say that Cl(2U) capitulates. As long as 
K and F are understood, we shall write Ψ instead of Pex. 

There are two basic tools for analyzing Ker(). The first concerns a 
group theoretic homomorphism called the transfer. The second, and the one 
we Shall use, is the following theorem due to Iwasawa [2]: 


Theorem 1 Let ΚΕ be an unramified Galois extension of number fields 
with Galois group G, and let Ex denote the group of units of K. Then 
Ker(¥) ~ H'(G, Ex). 


Now let K/F be a cyclic unramified extension of prime degree p. There is 
a natural lower bound on #(Ker(W)), namely for such an extension, p 
divides # (Ker(‘P)). This result is known as Hilbert’s theorem 94 and follows 
from Theorem 1 and the well-known fact that 


# (H°(G, Ex))/#(H'(G, Ex))=[K: ΕἸ! 


(see for instance Lang [3, p. 192]). There is also a natural upper bound on 
# (Ker()). 


Lemma 2 #(Ker(V)) divides p’*”*' where r is the rank of Ey as a 
Z-module, w = 1 if F contains the pth roots of 1, and w = 0 otherwise. 

Proof By Theorem 1, Ker(¥) ~ H'(G, Ex) where G = Gal(K/F). Also 
by the Herbrand quotient, #(H'(G, Ex))=p #(H°(G, Ex)). Now 
H°(G, Ex) = Ep/Nxp(Ex) and ER S Nx/-(Ex). Moreover, as a Z-module, 
Ery=~ W@®Z' where W is the cyclic group of roots of unity of F. Thus E;/E? 
has order p"*” where w = 1 if p divides #(W) and w = 0 otherwise. Thus 
#(H°(G, Ex)) divides p’*” so that #(Ker(V)) divides p’*”*!. Jj 


Now r is equal tor, +r, — 1 where r, is the number of real embeddings 
of F into C and r, is the number of conjugate pairs of complex embeddings 
of F into C. Thus we see that the upper bound of Lemma 2 goes to infinity 
along with [F:Q]. If N:Cl(K)—CI(F) denotes the homomorphism 
induced by the norm of ideals, then 


N(P(C1(Q0))) = C1)? 


so that Ker(‘¥) is an elementary abelian p-group. Let t be its dimension as a 
vector space over Z/pZ. We consider the following question. Can we find 
cyclic unramified extensions K/F of prime degree p for which t is as large as 
we wish? That the answer to this question is yes follows from Theorem 1 and 
the next theorem whose proof will absorb the remainder of this paper. 
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Theorem 3 Let L be any number field and p a fixed rational prime. Then 
there exists a cyclic extension F of degree p over L which in turn possesses a 
cyclic unramified extension K of degree p over F satisfying: 


p’ divides #(H'(Gal(K/F), Ex)) 


where r is the rank of Εἰ, as a Z-module. 


2. Proof of Theorem 3 


We will assume throughout that L is neither Q nor any imaginary quadra- 
tic field since in these cases r = 0 and the theorem is trivial. To prove the 
theorem, we shall construct a cyclic extension K, of L of degree p for which 


Nx, (Ex, ) = Εἴ (1) 
and 


K,/L is unramified at the infinite primes 
of L and there exists a finite prime of 
L ramified in Καὶ 4,71. (2) 


Then we shall construct another cyclic extension K, of L of degree p such 
that 


If p is a finite prime of L ramified in 
K,/L, then p is unramified in K,/L (3) 


and 


The compositum K = K, K, is such that 
N xx, (Ex) Ξ- EX: (4) 


Before actually constructing K, and K,, let us show that their existence 
suffices to prove the theorem. 

First, K, and K, are linearly disjoint over L. For there exists a prime p of 
L ramified in K, /L and such a p is unramified in K/L. Thus Gal(K/L) ~ 
Z/pZ Ὁ Z/pZ and so there exists a subfield F of K containing L with F 
distinct from K, and K, and with [K : F] = [F : L] = p. Choose any such F. 
We shall show that K/F satisfies the conclusions of the theorem. 

It is easy to see that K/F is unramified, for suppose ὋἜ is a finite prime of 
F ramified in K/F. If p is the L-prime that Ἢ divides, then p ramifies in K , /L 
and in K,/L, which is impossible by the construction of K,. Moreover, the 
infinite primes of F are unramified in K/F since the infinite primes of L are 
unramified in Κι (1. 
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It remains to show that p’ divides #(H'(G, Ex)) where G = Gal(K/F). 
Let Wx denote the group of roots of unity of Κα. By (1) and (4), 
Nx (Ex) = E®’. It then follows that 


Ex/Wx ~ E, W/W ® (xj (Ex)) W/W 


where y,,,(Ex) is the group of units ἤ of K for which Νκι(η) = 1 and the 
direct sum is as Z[G]-modules. For if € € Ex, then Ν κι ι(ξ) =n?’ where 
ἡ € E, is unique modulo roots of unity and ξη ἦἷ € y,,,(Ex). Thus 


H°(G, Ex/Wx) = Η“(Ο, Ε, ΜΠ) ® H°(G, (nx (EK))Wx/Wx). 


We cannot say much about the second factor in this direct sum decomposi- 
tion, but since G acts trivially on E,; W/W, and E,; Wx/W, is a free Z- 
module of rank r, #(H°(G, E, Wx /Wx)) = p’ and thus we see that p’ divides 
#(H°(G, Ex/Wx)). Then by the Herbrand quotient, p’*! divides 
#(H'(G, Ex/W,)). Now from the exact sequence of G-modules, 


1 —— We -- Ex, — Ex/W — 1, 
we obtain an exact sequence of cohomology groups, 
—— Ηἴ(α Μὴ -----α H', Ex) 
=> HG, Ex/Wx) —— H°(G, Wx) ——. 


But #(H°(G, W,)) divides #(Wx/Wi) = p” where w = 1 if K contains the 
pth roots of 1 and w = 0 otherwise. Hence the order of Im(z*) = Ker(6) is 
divisible by p’*1~™ so that ρ' divides #(H‘*(G, Ex)). 


3. Construction of K, and K, 


To complete the proof, we must construct K, and K,. We shall con- 
struct K, first. 

Let 1, 81. €,..., €, denote a system of coset representatives of E, (ΕἾ. Let 
L, = L(C,) where ζ, is a primitive pth root of unity. Let us first observe that 
an element « € L belongs to (L*)? if and only if « € (L;)?. This is obvious 
since [L, : 1] is prime to p. For each i, let M; = L,(¢}/?) where εἰ! denotes a 
root of x? — ε; in C. By our observation above and our choice of €;, we see 
that [M; : L,] = p. Since M;,/L, is cyclic, the density of primes of L, that split 
completely in Μ, is 1/p < 1. Thus there exist infinitely many L,-primes that 
do not split completely in M;. Now choose distinct L,-primes Ἔ; that are to 
satisfy: 


1. ; does not split completely in M;; and 
2. ‘B; does not divide 20,.. 
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Let p; denote the L-prime that Ὃν; divides. 

Suppose ¢; = BP for some β, ε Lig, then L, φίε }"}) = Ly, φ, so that ὃ 
splits completely in M,/L, contrary to the choice of ‘B;. We conclude that 
δι € (Ly g,)?. All the more so, ¢; ¢ (L,,)’. Moreover, p; does not divide 20,. 

Now 1, /(L,,)? is a finite elementary abelian p-group. Since e; ¢ (L,,)?, we 
can. choose a subgroup H; of L,, of index p not containing ¢;. By local class 
field theory, there exists a cyclic extension F; of degree p over L,, with 
Nyt, (Fi ) = H;. Note that ¢; ε U,,, the unit group of L,,,so that U,, £ H; 
and hence F,/L,,1s ramified. Let y,,: L5, ~ T be a continuous local character 
with Ker(y,,) = H; where here T denotes the set of complex numbers of 
absolute value 1. If δὼ denotes the set of real infinite primes of L, then for 
each p,, Ε S,, let χρ,: R > T be the trivial character. Now set S = {p;: 1 < 
i<s} ὦ δὼ and apply a variant of Grumwald’s existence theorem [1, p. 103, 
Theorem 5]. Since no p; divides 20,,S9 FS where Sy = {p|p divides 20,}. 
Thus there exists a continuous global character y: C,, > T of order p (the x,, 
have order p and the x, have order 1) whose restriction to L, is x, for all 
ΡῈ S (here C, denotes the idéle class group of L and we view L, embedded 
in C, in the usual manner). Let M = Ker(x). Since x is continuous and T 
Hausdorff, Ker(z) is closed. Moreover, [C; : M] = p since y has order p and 
Im(x) S T is cyclic. Thus M is an open subgroup of C, of finite index p. By 
class field theory, there exists a cyclic extension K , of L of degree p such that 
Nxyt(Cx,) = M. 

We shall now show that K, has the desired properties. If P; is a prime of 
K, dividing p; Dx,, then 


Nx, ply; (Kt, pi) = M 7 Lj, = Ker(y) ἡ L;. 
= Ker(x|7.) = Ker(y,,) = Hi, 


and so 8; * Nx, »/t,,(Kx p,). Thus δὶ Νκ, (Κι) for 1 <i <s and (1) fol- 
lows. Also, if p,¢S,,, then N Ky, go/Lpo(K1,9,,) = Ly, so that p,, is 
unramified in K, /L. In addition, by our original assumption on L, 5 > 1 and 
SO, as we observed during the construction, some finite prime of L ramifies in 
Κι, Thus (2) is also satisfied. 

The construction of K, is similar to that of K, and we will leave out 
some details at the points of similarity. 

Let 1, 11, 2, .--,, denote a system of coset representatives of Ex, /E,. 
For each i, let Ky; = K,(n}/?) where ἡ" denotes a fixed root of x? — ἡ; in C. 
Choose distinct L-primes q; which are to satisfy: 


1. q; splits completely in Κι /L; 
2. q; does not divide 20,; and 
3. if 2; is a K,-prime over q;, then ἡ, ¢ (Kf 9,)’. 
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To see that such q; exist, we first notice that there exist infinitely many 
L-primes that split completely in K,(¢,)/L but do not split completely in 
K, (ζ 1. Then by an argument similar to the one used in the construction 
of K,, we see that properties 1 and 3 are satisfied for such a q;. See Fig. 1. 


Ky ,(¢,) 


Fic. 1 


Since αἱ splits completely in K,/L, K,,9,=L,, and by 3, n; ¢ (L,,). 
Again, there exists a cyclic local extension M;/L,, of prime degree p for which 


ni ἐ Nite (M; ). Set 


A = {L-primes p: p ramifies in K,/L}, 
So = {L-primes p: p divides 20,}, 
S={q,:1<i<t!}USo VA. 


As before, define continuous local characters x, 1 —- T with Ker(y,,) = 
Nujt, (M7 ). In addition, for each p € A ὦ So, let y,: Ly —> T be the trivial 
character. Since [[,-5, Xp = 1, there exists a continuous global character 
x: Ο,--» T of order p for which y|,, = x, for all p ε S (again see Artin and 
Tate [1, p. 103, Theorem 5]). Let K, be the cyclic extension of L of degree p 
determined by y. It remains to show that K, satisfies the desired properties. 

Let Q; be a prime of K, over q;, let δ: be a prime of K = K, K, over Q;, 
and let p; be the prime of K, that ; divides. By construction of K,, 
ni € Nx;, pit, (K2, pv). But q; splits completely in Καὶ, /L and so p; splits com- 
pletely in K/K,. Hence L, = K;,9, and K2,,, = Kg, and we see that 
ni € Nxg/x, (Κα) for 1 <i <t. It follows that Nx)x,(Ex) = Ek, and so (4) 
is satisfied. Let pe AUS,. If Ἔ is a K,-prime above p, then 
Nx, g/t,(K2, φ) = Ker(y) ο Ly = Ker(y|r,) = Ly so that L, = Kz,» and p 
splits completely in K/L. In particular, p is unramified in K, (1. Thus (3) is 
also satisfied. This completes the proof of Theorem 3. J 
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1. Introduction 


Suppose we are given N points in U“, where k > 1 and U* is the unit 
cube, consisting of points y = (y,,..., y,) with 0 < γ᾽ < 1 (i= 1,...,k). Forx 
in U*, write Z(x) for the number of points among the N given points that lie 
in the box 0 < y,; < x; (i= 1,..., k), and put 


D(x) = Z(x) — Nx, °° x. 


The irregularity of the distribution of the given N points may be measured in 
various ways by the behavior of the function D(x). For example, one may 
consider the J?-norms 


I/p 
ID p= (floor ax)” 
(Here and throughout, except where noted, the integrals are over U*.) Roth 
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[4] proved that for some c,(k) > 0, 
| D 2 > ea(k)(log Ν) 1", (1.1) 


and Davenport [2] showed that for k = 2 this is best possible, except for the 
value of the constant c,(2). 


Theorem 1 For p> 1 there is a constant c,(k, p) > 0 such that 
| Dll, > cxlk, pMlog Nyt 92. (1.2) 


Since || D||, cannot decrease with p, this estimate for 1 <p <2 is a 
sharpening of (1.1). Moreover, it is best possible for k = 2 and 1 < p <2. 
Perhaps it is best possible, except for the value of c,(k, p), for every k > 1, 
p> 1. We cannot give an estimate of the same strength for || D||,. But with 

E(x) = D(x)(log Ν) δ᾽, 
we have 


Theorem 2 For g > k — 3, there is a 61 = c,(k, q) > Ὁ such that 
| [6 E(x) | |log|c2 E(x)| |? dx > 1. (1.3) 


This contains Theorem 1: For pick c,; =c3(q)>0 so small that 
z|log z|* < 4 for 0 < z « ο3. Then the integral of |c, E(x)| |log|cz E(x)| |? 
over points x with |c, E(x)| Σ 63 is > 4. But in this domain |c, E(x) |? > 
C4 [62 E(x)| |log|c, E(x)| Κ, where c, > 0 is the minimum of z?~* |log z|~4 
for z with z >c,. Thus 


| |c, E(x) |? dx > 3c, , 


and (1.2) follows. 
Sobol [6] noted that 


1 ¢s(k) 
[Dl > Z- Ν᾿ 


and in [3, 6] there is the statement that one can always give examples where 
||D||, <4. But Sobol gives such an example only for N = 1! In fact, we have 


Theorem 3 For some ἐς = ¢¢(k) > 0 and for large N (say N > 659), 


| Dl, > ες log log N/log log log N. 


The function on the right-hand side is the same as the one given by 
Aardenne-Ehrenfest [1] in her estimate of || δ. ie, the supremum of 
| D(x)|. As far as I can see, this is a coincidence. It is possible that our 
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estimate of || D||/, could be somewhat improved at the cost of a further 
complication of the proof. 

For the sake of completeness we mention that at present the best esti- 
mates for || D||,, are [δὼ > c1(k)(log N)“~*”? as a consequence of (1.1), 
and || D||,, > c, log N for k = 2, according to [5]. 

The following question is open: Is there a function f(N), decreasing to 0, 
such that for any C > 0 the measure of the set of x with |D(x)| <C, is 


O(f(N))? 


2. The Method 


Roth obtained his estimate by constructing an auxiliary function F(x) 
with 


| F(x)D(x) dx > cg(k)(log Ν}} (2.1) 
(where cg(k) as well as all of our constants are positive) and 
| F?(x) dx < co(k)(log N)F7?. (2.2) 


These two inequalities, together with Schwarz’ inequality, yield (1.1). It may 
be shown that the very. same function F(x) also has 


| F2"(x) dx < co(k, m)(log N)"*-) (m= 1, 2,...). (2.3) 
This is rather surprising since F(x) was constructed with only (2.1) and (2.2) 
in mind. Now by (2.3), 

| F lam < Cro(k, m)(log N)“~ 2 (m= 1, 2, ....), 
and since || F ||, cannot decrease as a function of r, we have 
| F Il. <cyi(k, r)log ΝΛ Σ᾽ (r > 0). (2.4) 
Holder’s inequality yields 
| F(x)D(x) dx < [6,6]: 
provided p > 1, r > 1 with (1/p) + (1/r) = 1. This inequality, together with 
(2.1), (2.4), gives (1.2). 
We shall use a slightly modified function F(x) with (2.1) and with 


| F?™(x) dx < (2m)"*-9(y 4 1)"™@-9 (m= 1,2,...), (25) 
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where ν 15 the integer determined by 


2N <2” <AN. (2.6) 


This explicit estimate will be needed for Theorem 2. (Roth’s function F(x) 
would give (2.5) with the exponent m(2k — 3) replaced by m(2k — 2), which 
would yield (1.3) under the slightly more restrictive condition that 
q>k-—1,) 

In order to prove the more difficult Theorem 3, we shall need certain 
finite sums, rather than integrals over U*. For Theorems 1 and 2, we could 
also have used finite sums, which perhaps would have made the exposition 
more uniform. But the notation with integrals is slightly simpler, and by 
using integrals we stay closer to the established style, as presented by 
Kuipers and Niederreiter [3] or Roth [4]. 


3. Generalized Rademacher Functions 


Every x in 0 < x < 1 may uniquely be written as 


= Y Bye)2? 


where the digits B(x) are 0 or 1, and where the sequence of digits does not 
end with 1, 1, .... The functions 


R(x) = (1) (r= 0,1...) 


are the Rademacher functions. (This notation is more convenient for us than 
the more common R,(x) = (— 1}, 
An r-interval will be an interval 


m2" <x < (m+ 1)27’, 


where m is an integer in 0 < μι < 2’. An r-function, where r is a nonnegative 
integer, is defined as a function f(x) in 0 <x <1 such that in every r- 
interval, either f(x) = R,(x) or f(x) = —R,(x). There are 2’ r-intervals, hence 
22" r-functions. An r-function has 


1 


| f(x) dx =0. 


Lemma 1 Suppose ἢ. ...,f, are 11, .--, "functions, respectively. Suppose 
an odd number among r;, ..., Γι is equal to their maximum r = max(rj, ..., Κι). 
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Then the product f,(x) --+ f,(x) is an r-function, and 


[-πο)--- πο) de =0 


Proof It suffices to observe that the product of an odd number of 
r-functions is an r-function, and the product of an r-function and an s- 
function with s < r 15 an r-function. 

Given a k-tuple r = (r,, ..., r,) of nonnegative integers, put 


rf Sry to Ἔ Τα: (3.1) 
For x in U*, write 
R,(x) = R,,(1) ve R,,(Xx). 


An r-box is defined as a box B= 1, x -:: x I,, where J, is an ΓΚ interval for 
j=1,..., k. An r-function is a function f(x) on U* such that in every r-box 
either f(x) = R,(x) or f(x) = —R,(x). There are 22" such r-functions. For 
given Xj, ...,Xj—1,Xj4+15---, X,, an r-function is an r,-function in the variable, 
x;. Combining this remark with Lemma 1, we obtain 


Lemma 2 Suppose f,, ..., f; are, respectively, r,, ..., r-functions. Write 
Γι = (Wi1,--+> Ye) and r = max(r,,,..-, 1). Suppose ancodd number among r,, 
.» Ty, equals r. Then 


{ fs) --- Λα) dx = 0 


In the lemma the kth coordinate is singled out. The same result is of 
course true for the jth coordinate with 1 <j <k. 


Lemma 3 Suppose k = 2, v > 0. Suppose for eachr with |r| = v we have 
picked an r-function f,. For ¥,, ..., To», with [τῷ ΞΞ τ τ᾿ = |r2_| = v, put 


S(t ay +5 Fam) = | fer) Sram) ἀκ. (3.2) 


Then 3(r,, .--5 F2m) = 9, unless there is a decomposition of {1, ..., 2m} into m 
pairs (i1, ji), --.» (ims jm) With r,,=1;,fort=1,...,m 

Proof For m=1 we know from Lemma 2 that S(r,, r,) =0 unless 
Γι ΞΞ 22» Le. unless r; =r, in view of |r, |= |r,|. For m> 1, we know 
again that the integral is Ὁ unless r,, = r;, for some i, + j,. Since the square 
of an r-function is 1, it follows that in this case 


3(r,, eeey lon) = 3(r,, assy ri “469 ri. ar lom)> 


where the dots mean that r 
completed by induction. 


r;,1,, have been omitted. The proof is now 
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4. Estimation of Moments 


Lemma 4 Suppose k > 2,m> 1 and v > 0. Suppose that for every r with 
|r| = v we are given an r-function f,. Put 


F(x) = Σ f(x). (4.1) 


[rl=v 


Then (2.5) holds. 


Proof Let M = M,,,(v) be the maximum of { F?"(x) dx, taken over all 
functions F(x) of the type (4.1), where each f,(x) is either an r-function or is 
identically zero. We shall prove that 


Μ,,(ν) « (21) 42 ΚΠ Dy + 1M". (4.2) 


Let the f, be chosen such that M is attained for F(x) given by (4.1). We then 
have 


M = | F(x™dx= Yoo Σ Sty. tomb (4.3) 
lrif=v = Jram|=v 
with S(r,, ..., Fam) given by (3.2). 

Suppose at first that k = 2. Then by Lemma 3 we may restrict ourselves 
to summands having r,, = r,, (t = 1, ..., m) for a certain division of {1, ..., 
2m} into pairs (i,, j;), -.-» (im>Jm)- If this is the case, the integral 3(r,,..., Pom) 
is Ὁ if some of the functions f,,, ..., f,,, are identically zero, and it is 1 
otherwise. The number of divisions into pairs is (2m — 1)(2m — 3)...3..1 « 
(2m)", and the number of possibilities for r;,, ..., ¥;,, is (v + 1)". Thus for 
k = 2 we obtain the desired estimate 


| F(x)?" dx < (2m)"(v + 1)". 


Now let k > 2. For each summand on the right-hand side of (4.3) put 
r= max(ri,, ---5 Tam.) and write h for the number of components r),,..-, 
Yom, Which are equal tor. Then 3(rj,..., lam) = O unless h is even, i.e., unless 
h = 2u with 1 <u <m. There are (3) ways to pick 2u elements out of {1,..., 
2m}, and 3(r,, ..., Γι) is invariant under permutations of ry, ...,T2_- Thus if 
the numbers u, v are always connected by u + v = m, then 


cae (21 ὁ 
M= py (3 »y dn Σ 2. 3(r1, soso Toys 51» +--+ 520)» 
u=1 μ 0 


r= 


where 5", is the sum over 2u-tuples rj, ..., 2, with |r, J=---= [Ὑχω[ Ξε ιν 
and ri, Ξ ΞΕ ὕχωκ = 7, and δ᾽. is the sum over 2v-tuples 5.» ..., 52υ with 
\s,|= τὴ --Ξ [520 =v and Siks cers S20, k <r. Thus if 


A,(x) = 2.” Sel), 
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where the sum is over r with |r| = v and r, =r, and if 
B(x) -- Σ." f(x), 


where the sum is over r with |r| = v and r, <r, then 


M=Y a Σ | A2(x)B?"(x) dx. (4.4) 
u=1 2u γξο 
By Hodlder’s inequality, 


/m 


| A2"(x)B?*(x) dx < | | A2™(x) ax) "| | B2"(x) ix) (45) 


The sum defining A,(x) is over r = (f,r) where F=(r,, ..., ™%-1) runs 
through (k — 1)-tuples with |F| = v—r. Hence for given x,, the function 
A,(x) in x1, ..., X,—-1 is of the type of F(x), but with k — 1 in place of k and 
ν — r in place of v. Thus if we proceed by induction on k, we may conclude 
that , 


| A2™(x) dx < (21) 2 ὅν - [γπάτ2 (4.6) 


The function B,(x) is like F(x), except that certain f, in the sum (4.1) are 
replaced by 0. Thus by our definition of M = M,,,,(v), 


| B?™(x) dx < M. (4.7) 
Combining (4.4)-(4.7) we obtain 
M <(v+1) y 2") ((2m)?*~ 5(v + 1, ?)"Mel™ (4.8) 


Dividing by M(v + 1) and putting M = 1" we get 


(ν- 1} « Σ (7) ((2m)?*-5(v + 1} Γ᾿)". 


So with K = (2m)**~°>(v + 1)§"7L' we have 


ν 4 ἡ-" « Σ [1)κ' 2 Σ ἄμ 5 (am? Ky 


Since 
π΄ ἢ 
-1 1)..." 
(v+ 1) ὩΣ Quy” Ὁ )", 
we obtain 4m?K > (v 1) ἷ, or 
L < (2m)**-3(v τ 1", 
or (4.2). 
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5. Construction of an Auxiliary Function 


Lemma 5 Suppose |r| = v with 
2” >2N. (5.1) 


Then there is an r-function f. with 
| f,(x)D(x) dx > N +2777 2-2, (5.2) 


Proof The integral in (5.2) may be decomposed into integrals over 
r-boxes. We choose f,(x) such that the integral over every r-box is 
nonnegative. 

If B is the r-box given by m;27"' < x; < (νι, + cae — a) oe 
be the smaller box m;2~"' < x; < (νι; + 4)2~" (i= 1,..., k). Then 


| R e(x)D(x) dx = [ ee ie [jit te 


Bio εἰ ΞῪὸῪ &=0 


., k), let B’ 


x Diy, + 6,277 77, oy +e 27 ἢ ἄγ. (5.3) 


Now 


1 1 
ue 2S 1)“ ΤειΖίγ, +627 Tye te 2. ἘΠῚ (5.44) 
ξι"Ξ &= 


is the number among the given N points that lie in the box 
γι Ξ Xj <y, +27" 1 (i= 1,..., k), which is contained in B. Thus this sum is 
0 if B contains none of the given N points. On the other hand, 


1 1 
δ ὑπῆν: Σ (-- 1).1} : ΜΕΝ Ἔ ΕΠ ἢ οὖν (y; + & 2-1) 


é1:=O e&=0 
S(S1) 24 =] {ς-- 12: © (5.5) 


If B contains none of the given N points, then 
| RA R,(x x) dx ee (— 1 ΝΥ: 


since B’ has volume 2~”~“. We observe that in view of (5.1), at least half of 
the 2” r-boxes B contain none of the given N points. Thus we obtain (5.2) for 
a suitably chosen r-function f,. 

Given v with (5.1), we construct an r-function f, according to Lemma 5 
for every r with |r| = v, and we define F(x) by (4.1). Then since the number 


Τ As noted in Section 2, our auxiliary function is almost the same as the one of Roth [4]. 
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of k-tuples r with |r| =v is > c,,(k)(v + 1)’, we obtain 


| F(x)D(x) dx > c13(k)(v + 1 12ὉΝ. (5.6) 


6. Proof of Theorem 1 


Let v be the number with (2.6). Then (5.6) yields (2.1), and (2.5) is true by 
Lemma 4. Theorem 1 follows. 


7. Proof of Theorem 2 
Write exp x = οὐ and 
Eo(x) = D(x)(v + ἡ), G(x) = F(x)v + ἡ) τ, (7.1) 
where F(x) is the auxiliary function constructed in Section 5. 
Lemma 6 For gq >k — 3, 


[ 1GGx)| exp(| G(x) |") dx < crak, 4). 


Proof Since || 6, is nondecreasing in «, we have [6]. < || G||2,, for 
a < 2m, whence by (2.5), 


a/2m 
| |G(x) ft dx < (| G(x)2™ ix} < (μι) - 3/2), 
Therefore 


J 16¢x)| exp(|G¢x)[") dx = Σ τι {06} ax 


ἜΝ 
= (1 + (h/g))(k — (3/2) 
«Σ μιά πᾷ 4)) , 


(7.2) 


where <f> denotes the next larger even integer to β. For large h, we have 
<1 + (h/q)> < 2h, and an exponent 


(1 + (h/q))(k — 3) < (1 — 26)h 


with ὃ = (1 — ((k — 3)/q)/4. Since h! > (2h)"~™ for large ἢ, a typical sum- 
mand is < (2h)~™, so that the sum in (7.2) is convergent, say to c,,4(k, 4). 
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With v determined by (2.6), we have 
| G(x)Eo(x) dx > c,5(k) > 0 (7.3) 


by (5.6), (7.1). Pick εἰς = Ci¢(k, q) so large that c,,e “"6 < 4c,,. Let S be the 
set of x in U* with 


| Eo(x)| < exp(| G(x)" — 6,6) 


By Lemma 6 the integral of G(x)E)(x) over S is < 4c,,. Hence the integral 
over the complement T of S is > $c,,, by (7.3). On T, 


(log | Eo{x)| + ¢y6)*> | G(x)|, 
so that 


J |Eo(x)| |log|Eo(x)| + οι! dx > | | Bo(x)G(x)| dx > ders. 


Choose c,;>0 so small that z|log z + cy¢|4< 4c,5 ἴοτ 0 « 2 «ο,7. 
Then 


i | Eo(x)| |log | Eo(x)| + cig |4 dx > 4c,5, 
where V consists of xe U* with |E 9(x)| >c,,. We now choose 
C2 = C2(k, 4) so large that 

ο; = οχί(ν + 1)/log N)*~ ¥/? 
has 
c,z|log c,z|? > 4cyz|log z + cy, |" 

for z > c,7. With this value of c,, 

ὁ E(x) = c, D(x)(log Ν) “ΠΥ = c Eo(x) 
satisfies (1.3). 


8. Estimation of Sums 


We now turn to Theorem 3. For simplicity of notation, we shall assume 
that k = 2. The constants y,, γ2. ... will be positive and absolute. 

Suppose we are given a discrete set of points in the quadrant y, > 0, 
y2 5. 0. For x = (x,, x.) in this quadrant, write Z(x) for the number of 
points among the given discrete set that lie in the rectangle 0 < y, < xj, 
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0 < y, < x,. Put D(x) = Z(x) — Nx, x,. Now if Q is a square 
m; <x; <m;+ 1 (i = 1, 2) (8.1) 
in our quadrant, the results proved above about [72 remain true for Q. More 


precisely, if the number N(Q) of points among the given discrete set in Q 
satisfies 


2" > 2N(Q), (8.2) 
then there is a function F(x) defined on Q such that 


| F(x)D(x) dx > y,(v + 1)27°N (8.3) 
Q 
(in analogy with (5.6)) and 
| F(x) dx <(2m)"(v+1)"  (m=1,2,...) (8.4) 


(in analogy with (2.5)). For example, in order to derive (8.3), we note that the 
alternating sums (5.4) depend only on the points of the given discrete set that 
lie in Q. 

Given a square Q determined by (8.1) and given a nonnegative integer μ, 
a grid T(Q, μ) will be a set of points x = (x,, x2) of the type 


x,=m,+6,4+1,27% (i = 1, 2), 
where €,, 5 are fixed with 0 < €, «2. " (i= 1, 2), and 1,, 1, run indepen- 


dently through the integers 0, 1,..., 2“ — 1. The grid I'(Q, μ) is contained in 
Q and the number of its points is 


TQ, μ)} = 4". (8.5) 

Because of the freedom in choosing €,, €,, there are infinitely many grids 
Γ(Ο, μ)Ρ contained in Q. 

Given integers p > v > 0, we may replace the integrals (8.3), (8.4) by 


sums over the points of a grid I'(Q, μ). This is so because in the alternating 
sums (5.3)}-(5.5) we never get outside the grid. We sum up our results in 


Lemma 7 Let Q be a square of the type (8.1). Let μ, v be integers with 

u>v>0 and = 2°>2N(Q). (8.6) 

Let a grid T(Q, μ) be given. Then there is a function F(x) defined on T(Q, μ) 
with 

TQ μ)}] Σ F(R)D(x) > viv + Π2"Ν, (8.7) 


χε T(Q, p) 


IT(Q, w)|-? 2, ae) <(2my"(v+1)" (m=1,2,...). (8.8) 
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We note that | F(x)| <v+ 1 since F(x) was constructed as the sum of 
v + 1 r-functions. 
More generally, we shall allow squares Q of the type 


m,27-*<x;<(m,+1)27* ΞΟ (ἐπ 1, 2), , (8.9) 
where / is a nonnegative integer. A grid I'(Q, μὴ will be a set of points 
X = (x,, x2) of the form 

XxX; = 27 4(m; + κ᾿ + I, 2; ") (i = 1, 2) 


where €,, €, are given with 0 < €, < 2-4 and where J,, /, run through 0, 1, 
..., 2" — 1. Such a grid is contained in Q and its number of elements 15 given 
by (8.5). 


Lemma 8 Let Q be a square of the type (8.9). Let μ, v be integers satisfy- 
ing (8.6). Let a grid T(Q, μ) be given. Then there is a function F(x) defined on 
ΓίΟ, μ) with (8.8), with 


TQ, μ[ Σ᾽  F(x)D(x) > γιίν + 12. 74N, (8.10) 


xeT(Q, μ) 
and with 
|F(x)| <v +1. (8.11) 


Proof The dilation T with Tx = 27x maps Q into a square Q’ of side 1 
and it maps ΓίΟ, μ) into a grid ['(Q’, μ). If P is the given set of points, put 
P’ = T(P) and define Z’(x) with reference to P’. Put N’ = 4~*N and D’(x) = 
Z'(x) — N’x, x2. Then 

D'(Tx) = Z'(Tx) — N'44x,x2 = Z(x) — Nx,xz = D(x). 
We apply Lemma 7 with I'(Q’, 2) in place of [(Q, y), and with N’, D’ in place 
of N, ἢ. We obtain a function F’(x) defined on I'(Q’, μ). The function 
F(x) = F’'(Tx) = F’(2*x) is defined on Γίο, μ) and has the desired 
properties. | 


Lemma9_ Let A, u, v, Q, T(Q, μ), F(x) be as in Lemma 8. Suppose X > 4. 
The number φ(ΧῚ of points x of T(Q, μὴ with 


| F(x)| > X(v + 1)” 
satisfies 
φ(Χ) < |T(@Q, »)| exp(—2-*x?) 
Proof We have 
φ(Χ) ΓίΟ, wu) |X?" < (2m)"— (m= 1, 2,...) 
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by (8.8). Pick an integer m with 
1 < (1/4e)X? « κι < (1/2e)X?. 
Then 
p(X)|T(Q, u)|~* < (2mX~ 2)" « ε΄" < exp(—27*X?). 


9. Preparations for Theorem 3 


The basic idea of the proof 15 as follows. For most squares Q of the type 
(8.9), N(Q) should be not much larger than Ν4 ὁ, where Ν is the number of 
our given points in U*. For such a square Q, we may choose v with 
2” > 2N(Q) such that 2” has the order of magnitude N4~ 4, and hence 2”* 74 
is about equal to N. Thus (8.10) becomes 

IPQ, μ)} Ὁ 2 - (x)D(x) > γχίν + 1), (9.1) 
xe ou 
for most squares Q. The auxiliary function F(x) has | F(x)| <v+ 1, but 
from Lemma 9 it is clear that it is usually not much larger than (v + 1)*/?. If 
we knew that the summands in (9.1) where | F(x)| is small compared to 
v + 1 give a large contribution, we could conclude that 


TQ" d [Dx 
xe F(Q, μ) 


is large. If we knew this to happen for many grids, we would ultimately be 
led to a good lower bound for || D||,. It will therefore be necessary to pay 
attention to the case when the relatively few summands of (9.1) where 
| F(x)| is almost as large as v + 1 give a large contribution. In this case there 
is a small subset A of I'(Q, μ) such that 


IT(Q, uw) |" Σ |D()| > 7s - (9.2) 


xeA 


In order to go beyond such a constant to a function that tends to infinity 
with N, we shall have to apply Lemma 8 with different values of A, μ, and v. 
We now proceed to carry out this program. Let p be the integer with 


4N < 4 < I6N. (9.3) 
Putting 
v(u) = (12)"(u!)? ss (u = 1, 2, ....), (9.4) 
we have 
μ4) 4 DF? exp(—27 2u- 2v(u)) < 4u exp(2v(u — 1) — 3v(u—1))< 1 (9.5) 
for u > 1. Let t be an integer with 
v(t) < 2p, (9.6) 
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and write 
A(u)=p— Fru), plu)=vu)t+1 (u=1,..., ἢ. (9.7) 
Then A(u) + μ(μ) = p + 3v(u) + 1 is increasing in 1 <u <t, and 
2 ΟΝ = ΔΝ δ (9.8) 
A square Q will be written as 0" if it is given by inequalities 
m,2-*) <x, <(m,+1)2-*™ (i = 1,2) 
with integers m; in 0 < μι, < 2*. For 1 < v < u < t, the number of squares 
9" contained in a given square Q” is 
A(v, u) = 4: τ 200, 
For v < u and a given square 0", write A(v, 05) for the number of squares 
Q’ « Q with 
N(Q’) < 2N4°*™. (9.9) 
Clearly, 
0 < A(v, 05") < Α(υ, u). (9.10) 


The right-hand side of (9.9) is 2N times the area of 0", while the average of 
N(Q’) over all squares Q” is N times the area of Q”. It follows that (9.9) must 
be true for at least half of the squares Q”, i.e., for at least $4“ squares 0". So 
for v <u, 


Σ, Α(υ, Q*) > 34, (9.11) 
Qu 


where the sum is over all squares Q”. 
For 1 <u <t and every square 0", put 


h(Q") = yal A(v, ΟΑ(, u), (9.12) 

so that by (9.10), 
0<h(Q") <u. (9.13) 
We shall write I’ for a grid I'(Q", u(u)). The number of points of I is 
Bu) = [Γ"] = 4. (9.14) 


Put 
Biv, u) = Β(μ) Β(υ)" ἦ = 448-40), 


For 1<v<u<t, the square Q is the disjoint union of A(v, u) squares Q”. 
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The grid Γ" = T'(Q*, u(u)) has “mesh” 2. *~#™ (.e., its points have horizon- 
tal and vertical distances of this size), and hence I™ is the union of A(v, u) 
grids of the type T'(Q°, μ(μ) + A(u) — A(v)), which also have this mesh. In 
turn, since μ(μ) + A(u) — A(v) > μίυ), each grid T(Q’, μ(μ) + A(u) — A(v)) is 
the disjoint union of 24 *#~4@)—#®) grids Γ(Ο", u(v)). We write all this up 
in 

m= (Ur (9.15) 

Qucqv 


Here I = Γ(ρ", μ(ω)), the outside union is over the A(v, u) squares 0" < Q”, 
and the inside union |)’ is over 


JD A(u) + μι) — Av) — wv) — Biv, u)A(v, u)~ 1 


grids Γ" = T'(Q”, y(v)). 
Given a grid I, write 


9(Γ5) = Bu)" Σ, [D&)I. (9.16) 


xe lu 


10. A Lemma 
Lemma 10 For 1<.u<t and a grid T’* =T(Q", μ(μ)), 


σί(Γ") = y4h(Q"). (10.1) 
Proof We take 
Ya=12 ° (10.2) 


where γ, is the constant of Lemma 8. 
We begin with u = 1, in which case 


h(Q") = A(1, Q”), 


which is 0 or 1. The interesting case is when it is 1; then N(Q') < 2Ν4 10) 
or by (9.3), (9.7), 


2N(Q') < Ν4" 40) = N4t~ 02) < 2.6) 


Thus (8.6) is true for μ = μ(1), ν = v(1), F(Q’, u(1)), and we may apply 
Lemma 8. Since | F(x)| < v + 1, (8.10) yields 


σ(Γ") τ ἢ δι [τ by ‘1D, > Y1 27 aN > γι 21 > 74 
χεΓ 


by (9.8). 
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We now proceed to the induction from u — 1 to u, where 2 < u < t. By 
(9.15) with v =u -- 1, 


9(Γ") -- Β(ίω -- 1, ΠΣ Dd’ φ(Γ"}), (10.3) 
Qu-1cQu [u-t 
where )” is a sum over B(u — 1, eos 1, u)~* grids P71 c Qe}, 
On the other hand, for 1<uvu<u—1I1, we have A(v,u)= 
A(v, u — 1)A(u — 1, u) and 


A(v, Q*VA(v, ι) = ΟΣ Αρ, OVA, ι) 


Qu-1cQu 
-- Α(μ-- 1, με) Σ᾽ A(v, Q*~*)/A(v, u -- 1). 
Qu-1teQu 
It follows that 
NO") = Awe 05) Aw Luy* ΣΟ MOM (04) 


u-1 <Qu 
It will be convenient to write this as 


NO") = Aw, 05). Bu Ly) EVM, (105) 
where the inner sum δ΄" is over B(u — 1, u)A(u — 1, u)~* grids “7! < 05 1, 
as in (10.3), and the summand h(Q“~') is independent of Γἧ ?. 
Our inductive assumption yields σίΓ" 1) > y,h(Q"~'). Comparing 
(10.3) and (10.5), we thus obtain the desired relation (10.1), provided 
A(u, 0") = 0. It remains to deal with the case when 


A(u, 0") = 1. (10.6) 


Now the double sum in (10.3) and (10.5) is altogether over a set ὦ of 
B(u — 1, u) grids T“~'. To deal with the case (10.6), it would suffice to 
construct a subset T of S such that (with the notation h(I“~ 1) = h(Q*"~*) if 


mt =T(Q*"*, wu — 1))), 
Β(μ -- 1, )Γ} DY (gE *) -- γε Ἀ(Γ’ 1})} γε. 


Feu-let 
Since μ(0 " ἢ) < u — 1 by (9.13), it would suffice to have 
Biu—1,uyt Σ φ(Γ" ") γ4 Blu — 1, ἡ) ἰμ| 2} + γε, 


Te-let 


where |Z| is the number of elements of T; or 


~ Dd |P&)| 27. Bu — 1, u)-tul Z| + γε. 


Ie-tetxelfu-i 


Prime Ideal Decomposition in F('’”), ΠῚ 


WILLIAM YSLAS VELEZ 


UNIVERSITY OF ARIZONA AND SANDIA LABORATORIES 
TUCSON, ARIZONA ALBUQUERQUE, NEW MEXICO 


1. Introduction 


Let F be an algebraic number field, » an integer in F, x” — pu irreducible 
over F, and # a prime ideal in the ring of algebraic integers in F with 
(m, 99) = 1. We are interested in discovering how the ideal 99 decomposes 
when considered as an ideal in F(y!/”). It was shown, in Mann and Vélez [2, 
p. 132], that we can reduce this problem to the situation where (u, 7) = 1. 
Under this assumption we have that Y does not ramify in F(u*”). In Mann 
and Vélez [2] we also defined , Ψ,, (Φ), the counting function for the decom- 
position of F in F(y'"), where Ψ,, (99) merely counts how many ideals 
there are of each degree in the decomposition of F in F(u*”"). That is, if 


PnP) = Yai 


then # factors in F(u'/") into g; ideals of degree f,, for i= 1, ..., n, and 


m= δίς ι gi fi- 


+ This work was supported in part by the U.S. Energy Research and Development Admin- 
istration (ERDA), Contract No. AT(29-1)-789. The author is currently at Sandia Laboratories. 
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Let Q* denote the nonzero rational numbers and Z the ring of integers 
and consider Z(Q*), the group ring of Q* with coefficients in Z. Let us 
denote elements in Z(Q*) by δὲ n{r,], where n; € Z, r, € ΟἿ. Then Z(Z*) is 
imbedded in a natural way in Z(Q*). Moreover, - Ψ,, (9) = δ᾽ σί f;] can be 
considered as being an element in Z(Z*) since g; ε Z and f; ε Z*. Hence, we 
can define an addition and multiplication for counting functions by con- 
sidering the counting functions as elements of Z(Z*). 

In Mann and Vélez [2] we were able to explicitly compute ,;‘Y;.(P), 
where ! is a prime, (Ju, 9) = 1. Furthermore, we showed that if m = | [k_, Εἰ 
and [{Ν(Φ) -- 1, where N(P) denotes the norm of Y over 0, then 
ΕΨ, (Φ) = ΠΕ, Ψ (9) 

In this paper we shall give a recursive procedure for computing ; Ψ,, (9) 
in terms of the counting functions , ‘Y,(P), where / is a prime. Furthermore, 
we shall investigate the phenomenon where 


δ 5 


ΕἍ Ψ, (2) = [[ ePie(F), m= [le 


i=1 i=1 


2. Theorems and Proofs 


By o(u) modulo Y we shall mean the multiplicative order of μ in the 
group of units modulo &. 


Lemma 1 Let x” --τμ be irreducible over F, | a prime, (1, m) = 1. If 
I" || o(u) modulo 99 (where I" || b means I" |b, I"*! } b), then I" || o(u*!) modulo 
P,, where FP; is any prime divisor of P in F(p*""). 

Proof The proof follows by applying Theorem 2 in Mann and Vélez [2] 
and induction. ἢ 


Let | N(P) — 1. Then from Rules 1-4 in Mann and Vélez [2], we see that 
r'V¥,(P) depends only on three parameters, namely, I, /*, where 
|| N(A) — 1, and I" = (ο(μ), I). Set N(P) = p’. 

If 1+ N(P) — 1, then ; ‘¥,.(PA) depends only on three parameters, namely, 
r, and I”, where I” || p/” — 1 and r is the order of ρ΄ modulo |. To emphasize 
this, we set ,‘P,(PA) = C(I5, o(u), p’), where C is computed according to 
Rules 1-5 in Mann and Vélez [2]. 

From the above comments we have the following: If (q, /) = 1, then 


C(I, q ο(μ), p’) = C(F, o(u), p’). (1) 
If (gq, 1) = 1 and /| p’ — 1, then 
C(I, ο(μ), ν΄) = C(I, ο(μ), p’) (2) 


since if I* || ρ΄ — 1, k > 0, then I* || p/ — 1. 
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117 (p’ — 1), ris the order of ρ΄ modulo | and (q, Ir) = 1, then 
C(I, o(u), Pp’) = C(F, ο(μ), ν΄) (3) 


On using Theorem 4 from Mann and Vélez [2], we can also show, by 
induction, that ,'P,,(P) depends only on μι, N(P) = p’, and the order of μ 
modulo Y. Hence, we set - (A) = C(m, ο(μ), ρ΄). 

If (gq, m) = 1, then we have 


C(m, q ο(μ), ρ΄) = C(m, ο(μ), p’). (4) 
Let m= q°!-+- 42 151 5. Ie", where the q;, 1, are distinct primes, q;|p’ — 1, 
|, + p! — 1 and γ᾽ is the order of ρ΄ modulo ἰ;. If (4, 4) = (4, r;/;) = 1, for all 
i, j, then 


C(m, ο(μ), ν΄) = C(m, o(u), ν΄). (5) 
Given x"—p and N(P)=p’, write 
m = 441... gif) ++ If, where q;, 1; are dis- 
tinct primes, q;|p’ — 1,1; ρ΄ — 1, r;is the 
order of ρ΄ modulo I,, and 4} p’ — 1. 
(6) 

Lemma2 Let x"—yp be irreducible over F, (mp, P)=1, and 
rE (ΦῚ = Σὲ GL Si], then f,|mR,, where R, = | [f=1 ;.for each i. 

Proof We shall prove this by induction on n. Assume that n = 0 in (6) 
above. Then by Theorem 5 in Mann and Vélez [2], we have that ; V,,(P) = 
[]fe1 rVYoe(P). But, from Rules 1-5 in Mann and Vélez [2], the theorem 
holds for each -'P,.(P), hence it holds for m. 

Assume that the assertion is true for n=k and m= q{' "ig ae 
Ket = m, Gti. Let FE κι (A) = Σ᾽ gf fi], then by induction, fll, R,), for 
each ἢ, that appears in the sum ; ¥,, AP). 

Let P, be an ideal factor of 99 in F, = F(u'/"') of relative degree f;, for 
each i. Then by Theorem 4 in [2], we have that 


ΞΞΞ (δ᾽ σφ ἢ) Fi Ph. tt? (P;). 
But if -, ‘Py, , «+s ae σι πη. then fi;], it} - r,4 by Rules 1-5 in Mann 
and Vélez [2], ts ee j. Hence i ee So {ἢ ἈΠ] ΙΕ ει. 
However, f, f;; is an arbitrary term ἴῃ; Ψ,, (9), so the induction 15 
complete. ἢ 


Lemma 3 Let x" — μ be irreducible over F, (mp, 7) =1, m=m,F, 
(m,, 1) = 1, La prime, and Ψ, (9) = >. gil fi). fl |p? — 1 and (1, f;) = 1, for 
all i, then ;V (Φ) = F Ψ, (9. pV dP). If Ly p! -- Land (rl, f;) = 1, for alli, 
then -VlFP) = ¢ Em (Φ)- ¢ Vid). 

Proof Let F, = F(u''™) and Y; a prime factor of F in F, of relative 
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degree f;, for each i. Then by Theorem 4 in Mann and Vélez [2], we have that 
rE (9) = Σὲ 9A file, Vi(P;). But, εὐ Ῥ fF) = C(I ο(μ᾽ ""), pt). By 
Lemma 1, if ["|/o(u), then [|/o(u’’"™) since (m,,1)=1. Hence 
Ο(Γ, o(u™), pot) = C(I, ο(μ), »" 

If 1] p? — 1 and I” || p’ — 1, then || p/" — 1, since (f;, J) = 1. 

If! + p’ — land I” || p/” — 1, then r is the order of ρ΄! and I” | ρ77:" — 1, 
since (f;, rl) = 1. 

Hence C(I, o(u), ρ΄) = C(I, ο(μ), ν΄) = p¥,(P). So we have that for 
each i, p, Pi(P;) = p ¥i(P). Hence -¥,(P) = Ψ,, (Φ)-, Ψ,(Φ) ἢ 


Theorem 1 Let x"— 4 be irreducible over F, m=m,l, l a prime, 
(mu, P) = (m,, l) = 1, and | is the largest divisor of m such that 1} ρ΄ — 1, 
then 


PY AP) = FE m (FP) + C(m, o(u), ΡΨ, (ΦῚ — 1[1]). 


Proof Let m= 41' 55" gsi: =m, Vf, where I = It Since r,|1;— 1, 
we have that r; < 1; < l, i <n, hence (J, R,_,) = 1. Hence (1, m, R,~,) = 1, 
where R,-, = | [721 τι. 

Let F, = Ε(μ᾽ ἢ, then Ψ(Φ) = 1[1] + ([-- 1)/r[r]. If c = 1, then 


FE AP) = Ψ, (P) + ([-- 1)/r[r] - C(m,, ο(μ), ””) 
=  Ψ, (9) + C(m, ο(μ), P’")(¥(P) — 1[1)). 


Hence, the theorem is true for c = 1. Assume it is true for c = k and let 
πὶ τ μι. τὶ Then 


FE (72) = Ψ,,μι{99} + ([ -- U/r[r] - C(m, Γ΄, 1. ο(μ), ρ΄"). 


By Lemma 2, if C(m,,1- ο(μ), ν΄) Ξε Σ᾽ σίί 7], then f,|m,R,—1. Hence 
(f;, 1) = 1, so by Lemma 3 we have that 


C(m, BP, 1 - o(u), p") = C(my, o(u), p/)C(, 1 - ο(μ), Pp”). 
Hence 
FE m(P) = Ψ, (9) + C(my, ο(μ), »"}(, Ῥμ(9) — 1[1]) 
+ (I 1/r[rJC(m, ο(μ), ρ΄" (, Ι- ο(μ), p’”) 
= Pn (9) + C(m, ο(μ), ν΄) 
(rp Pu(P) + (Ι -- 1)/r[r]C(, 1 - ο(μ), p’”) — 1[1]) 
= Ψ, (9) + C(m, ο(μ), Pr n+ (FP) — 11), 
by Lemma 3 of [2]. So the induction is complete. J 


Theorem 1 gives a recursive procedure for writing ,'¥,,(P) in terms of 
the counting functions of the divisors of m. That is, if m = 441 ““- q@IS! --- 
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ln = m, Ii", as in (6), where |, < 1, «:- <1,, then by Theorem | 

pV m(P) = Ψ, (P) + C(my, o(u), »΄")(» Pije(P) — 11]}ἅ 
Note, however, that ,'Y,,,(P) has exactly n — 1 primes dividing m and not 
dividing p’ — 1, C(m,, o(u), ΡΠ") has no more than ἢ — 1 primes dividing m 
and not dividing ρ΄" — 1, while ; Ψ,, (99) has exactly n primes dividing m and 
not dividing ρ΄ — 1. Hence, this procedure can now be applied to ,'P,,, (P) 
and C(m,, ο(μ), p’”). 


Example 1 If n = 1, then 
Ψ, (9) = FP mn (9) + C(m, ο(μ), Pr Ψ, {Φ) — 1Π]} 


However, by Theorem 5 in Mann and Vélez [2], we have that 
= [] e¥al) + [ΠΟ οὐ. 0/9) (aA) ~ 10D. 
50,, ΨῬ,, (99) is expressed in terms of the known counting functions of its 
prime divisors. 
Example 2 m= qi! °°: φο 15} = m, (113, 1, < 1,, then 
FV AP) = FP mite (P) Ὁ C(m, 1's ο(μ), ΡΠ Ψι,(Φ) _ 1[1]} 
= En (FP) + C(m, ο(μ), PrP i,(F) — 11]) 
+ C(m, πὶ, o(u), plt2)(p (PF) — 1[1]). 
Now, the first two terms can be factored as in Example 1. We consider two 


cases: (1) 1, | p’"? — 1, (2) 1, 7. p/"? — 1 and the order of p/"? modulo 1, is r', 
then in case 1, 


C(m, ', o(u), p’"?) = Tet qi’, o(u), ρ΄") - C(I, o(u), po”), 


by Theorem 5 in Mann and Vélez [2]. 
In case 2, we must apply Theorem 1 and we have 
C(m, ', o(u), p’”?) = C(m,, o(u), Pp”) 


+ C(m,, o(u), p77 )(C(I, o(u), ρ΄) — 11). καὶ 
Lemma 4 Let x" — μ be irreducible over F, (lu, P) = 1, F|| py — 1, 
k > 1, and I" = (ο(μ), I), then - ¥.(P) = [1] iffe<k —n. | 
Proof The lemma is proved by referring to Rules 1-4 in Mann and 
Vélez [2]. If Rule 1 applies, then 


pr P) ΣΕ: ΓΒ ee) ἘΞ [{1] iff c<k—n. 


Rule 2 cannot apply here since k = 1 = n. If Rules 3 or 4 apply, then n = 0 
and the assertion is obvious. J 
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Lemma 5 [fd is an integer in F, (A, A) = 1, then for each m there is an 
irreducible polynomial x™ — yt such that p = A (mod FY). 


Proof Let # bea prime ideal in F such that (7’, A) = 1. Then, we can 
find an ideal . such that (./, AP’) = 1 and PW = (π) (see Mann [], p. 
50]). Hence ’ || x. Since (?’, A) = 1, we can solve the system of congruences 
x = A/nx (mod FY), x = 1 (mod FY’). Let β be such a solution and set μ = βπ. 
Then p = A (mod Y). Furthermore, since F || u, we have that x” — p is an 
Eisenstein polynomial, and hence, x” — μ 15 irreducible for all m. ὶ 


In Lemma 3 we had the situation where ;'V,, (92) = - Um (FP) * er Ψ, (2). 
The next theorem, in fact, characterizes this phenomenon. 


Theorem 2 Let m=m,I, (m,, 1) = 1, x" — p irreducible over F, then 
rE nl(F) =  Ψ,, (P) * Ψ,(Φ), for all such m, iff 1ΠΡ’ -- 1 and c<k—n, 
where k and ἢ are as in Lemma 4. 

Proof Assume first that |p’ — 1 andc <k —n. Let F, = F(u*"), then 
by Lemma 4, we have that - Ψ,(Φ») = [{1]. Hence, if , is any factor of 99 in 
F,, then 


rE (Φ) = be, Bm (Pi) = PC(m, o(u""), ρ΄") = FC(m,, ο(μ), Pp’), 


since (1, m,) = 1. Hence , ¥,,(P) = |g Pn (FP) = ¢ LidlA)e Uni (FP). 

Next, assume that , Ψ,,(Φ) = pV (PF) > Ψ, (2), for all my, (m,, 1) = 1. 
Assume first that | ρ΄ — 1 and let r be the order of p! modulo I. Let |, bea 
prime such that 1, {ρ΄ — 1 and (r,,7r) #1, where r, is the order of ρ΄ 
modulo /,. By assumption, 


pV (PF) = ¢ PAP) - rl (A) = » ῬΙ(ΦΊ(Π] + Ἦ (ἢ — 1)/rilri]). 


Now, let F, = F(u'/"") and A, a prime factor οἵ 99 in F, of degree 1, P, a 
prime factor of A in F, of degree r,, then 


pV (P) = ry ¥(Po) + (hi -- 1)/ri[ri] - CL o(u), pi") 
= ΨΙ(Φ) + (4 -- 1) {π|]6(!, o(u), pi") 
= pV (FP) ‘ rv, (P) = pv (FP) + (1, —,1)/r,[r,] ᾿ pV (P). 
Hence, - P(A) = C(I, ο(μ), ρ΄"), but this is impossible since (r, r,) # 1. So, 
if -Y(P) = Ψ,, (Φ) * VAP), for all m, prime to J, then [| p/ — 1. Since 
ΠΡ’ — 1, let || p’ — 1 and choose k, so that Κὶ > k and c<k, —n. Set 
K =k, —k. Let 1, bea prime so that (1,, ρ΄ — 1) = 1 and [*|r,, where r, is 


the order of ρ΄ modulo I,. By Lemma 5, we know that there exists an 
irreducible polynomial x"! — A, where 4 =p (mod F). So ο(λ) = ο(μ). 
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Hence 


r Pia (P) = Ψ,(Φ) + ({ — Y/rilrsJC(E, o(u), Pp’). 
Since [*|r, and If | ρ΄" — 1, we have that /*'| p/"! — 1. But c < k, — n, hence 
by Lemma 4, we have that C(F, o(u), ρ΄") = I*[1]. Hence 


FP a (P) = Ψ, (9) + (Lh, — 1)/r,[r] . [11] 
- Ψ.(Φ) Pi (Φ) = Ψ, (9) (ἢ -- UW/rilrile VAP), 


5.0. P,{P) = [{1], hence c<k—n, by Lemma 4, and the theorem is 
proven. § 


Theorem 3 Let x” — μ be irreducible over F and m is written as in (6), 
then 


rE WP) = I FPG (ὦ » Il PY cP ) 
i= j= 
if (a) (r;, rj) = (r;, Lj) = 1, and (Ὁ) for each i, either (q;, r;) = lore; <k;—n,;, 
for each j, where qj! || o(u) modulo F. 

Proof Assume that (r;,r;) = (γι. /;) = 1 and (q;, γι) = 1 or 6; <kj — nj. 
We shall induct on the number of prime factors of m. If m is a power of a 
prime, then we are done. Hence assume that m has k + 1 distinct prime 
factors and the theorem is true for k prime factors. If δ; < Κι — n,, then by 
Lemma 4 we have that Ψ, (9) = Ψ, (FP) - Pg e(P), where m, = μι 47}. 
Then induction applies to ,;'P,,,(A) and the theorem is proven. Hence, we 
may assume that e, > Κι — n,, for all 7. Then we must have that for each i and 
j, (γι. 4;) = 1. If n = 0, then we are done by Theorem 5 in Mann and Vélez 
[2]. Hence, assume that n # 0. Without loss of generality, we may assume 
that |, >1;, i< n. Then, by Theorem 1, we have that 


rE (FP) = ¢ Ym (9) + Cm, o(u), pi") p Yen P) — 1{Π1]}. 


where m, = m/l". But (r,, 7;/;) = 1 = (r,. q,), for all j, i< n. Hence, by (5), 
we have that 


C(m, ο(μ), ΡΠ") Ὡς C(m,, o(), ΡΠ τν ΕῬ,, (9), 


80, Ψ,, (92) = Ψ,.,(92})᾿ eV n(P), So, we can apply the induction hypoth- 
esis to ; V,,,(P), and the induction is complete. J 


Conjecture We conjecture that if 


Vm) = ΠῚ ¥en) ΠῚ 2 Pi AP) 


the numerical conditions, (a) and (b), in Theorem 3 hold, that is, that 
Theorem 3 is actually a characterization of this phenomena. 
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Rational Quadratic 
Forms and Orthogonal Designs 


WARREN WOLFE? 


QUEEN’S UNIVERSITY 
KINGSTON, ONTARIO, CANADAT 


Orthogonal designs were introduced as a generalization of Hadamard arrays and their existence 
has been studied by several authors. This work has led to some intricate and elaborate methods and 
results, but, to date, no general method has been developed. In this paper we claim to isolate the 
algebraic properties of orthogonal designs and show that the existence of these designs is dependent 
upon the equivalence of quadratic forms. We give strong necessary conditions for the existence of 
orthogonal designs in orders 2’ - no, ng odd, r = 0, 1, 2, or 3. 


Definition An orthogonal design in order n and of type (uy, ..., u,) on the 
commuting variables x,, ..., X, iS an n X n matrix X with entries from {0, 
+X ,..., +x,} such that ΧΧ' - (δ u;x?)I,. 


Alternatively, each row of X contains u, entries of the type +x, and the 
rows are formally orthogonal. Such matrices were first described in Gera- 
mita et al. [2] and have been very useful for determining the existence of 


+ This paper is based on the author’s doctoral dissertation written at Queen’s University, 
Kingston, Ontario, Canada, under the supervision of Dr. A. V. Geramita. 

1 Present address: Department of Mathematics, Royal Roads Military College, Victoria, 
British Columbia, Canada. 
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weighing matrices and, in particular, Hadamard matrices. (See Geramita 
et al. [2], Geramita and Wallis [3] or Germaita and Wallis [3a].) 

If X is as above, we may write X = Σὲ A;x; where the A, are the 
coefficient matrices. Then the A; are 0, +1 matrices of order n such that: 


(i) the A; are disjoint, ie, if A;= (aq), A; ΞΞ (δ 4) (i Ξ ἢ), then 
a, 54, = Ὁ for every k, 1; 
(ii) A,;At=u,I,,1<i<s; 

(iti) A; A, + A,;Aj}=0,i 4). 

It has been demonstrated that each of the above properties sets a strong 
criteria for the existence of an orthogonal design. In this paper, however, we 
shall examine only the algebraic properties, namely (ii) and (iii). For this 
purpose we make the following definition: 


Definition A rational family in order n and of type μι. ..., u,] is a family 
of n x n rational matrices {A,, ..., A,} such that (ii) and (iii) above are 
satisfied. 


Surely a rational family reflects the algebraic properties of an orthogonal 
design, and the existence of an orthogonal design will imply the existence of 
a rational family in the same order and of the same type. 

If {A;, ..., A,} is a rational family as above, then: 


(i) {a,A;, 1 <i <-s}, where a; € Q isa rational family in order n and of 
type [αὐ uy, ..., asu,]; 
(ii) {A, + Az, A3,..., A,} is a rational family in order n and of type 
[uy + Uz, Uz,..., μι}; | 
(iii) if P and Q are n xn rational matrices such that PP‘ = pI, and 
ΟΟ' = ql, then {PA;Q, 1 <i <s} is a rational family in order n and of type 
[pur 4, ..., pusq]. 


Thus, one may “translate” a rational family in the same order and so 
change the type numbers to a certain extent. 

The first theorem for rational families limits the number of matrices in 
such a family and was first noted in Geramita and Pullman [1] and later 
described in terms of orthogonal designs. 


Theorem 1 (Radon-Geramita-Pullman) There exists ἃ rational 
family in order n with s members if and only if s < p(n) (the Radon function), 
where, ifn = 2477" - ng, no odd, 0 < b < 4, then p(n) = 8a + 2°. 


Necessity in the above theorem is proved using Radon’s result on the 
maximal number of anticommuting, skew-symmetric, orthogonal real 
matrices in order n (see Radon [6]). Sufficiency is given by Geramita and 
Pullman [1], who have shown that, for a given n, there exists a rational 
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family in order n with p(n) members and of type [1, 1, ..., 1]. In fact, the 
matrices in their rational families are disjoint 0, +1 matrices. 

The Radon function is easily computable; for example, p(2n,) = 2, 
p(4no) = 4, p(8no) = 8, p(16ng) = 9, ..., where no is odd. 

Further existence theorems for orthogonal designs have been found in- 
volving type numbers. In certain orders, Geramita and Wallis have shown 
that these type numbers were closely related to sums of squares (see 
Geramita et al. [2] and Geramita and Wallis [3].) We shall show that, more 
precisely, rational families (and hence orthogonal designs) rely on the equi- 
valence of rational quadratic forms. We refer the reader to Serre [7] for 
particulars and shall establish notation and definitions. 

Let f be an n-dimensional quadratic form over Q, and let M, be the 
uniquely determined symmetric nxn rational matrix such _ that 
f(X) = X'M,X. 

Two quadratic forms f and g are said to be equivalent if there exists an 
invertible matrix P such that M, = PM, Ρ'. We write f ~ g to denote this 
equivalence. Every quadratic form f is equivalent to a diagonal form, 1.6., 
there exists an invertible matrix P such that PM, Ρ' is the diagonal matrix 
dg(a,,..., a,) for some a; Ε Q. We denote this diagonal form as <a,,...,a,>. 
Let π΄’ (1) denote the form given by the identity matrix I,. 

If-f = Cay, ..., a,>, we let 6(f ) be the discriminant of f, i.e., the image of 
[ΠῚ α; in the group Q*/Q*?. 

Let p be a prime and let Q, denote the field of p-adic numbers. If a, b are 
in Q,, the p-adic Hilbert symbol of a and b is (note a, b nonzero) 


If f = Cay, ..., a,> 18 a rational quadratic form, then the p-adic Hasse invar- 
iant of fis s,(f) = | ]i<; (a, a;)p- 

We can now state the main theorem on the classification of rational 
quadratic forms. 


Theorem 2 (weak Hasse-Minkowski) f and g are equivalent rational 
quadratic forms if and only if they have the same dimension, the same discrimin- 
ant, the same Hasse invariant at each prime, and the same number of positive 
terms in equivalent diagonal forms. 


Theorem 3 If n= 1, 2, 4, or 8, then there exists a rational family in order 
n and of type [u,, ..., u,] if and only ifn: <1) = (uy, ..., u,>. 

Proof If {A;, 1 <i <n} isa rational family in order n and of type [u, 
..., U,], let v; be the vector in Q" that corresponds to the first row of A;. Then, 
by the properties of a rational family, the v, form an orthogonal basis for Q” 
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where the norm of »; (in the usual Euclidean sense) is u;. Let P be the n x n 
matrix whose ith row is v;. Then PP' = dg(u,, ..., u,) and, consequently, 


ney © ζῆ) νοῦ ΟΝ, 


Conversely, ifn - (1) ~ <u,,..., u,>, then there exists an invertible n x n 
matrix P = (p,;) such that PI, Ρ' = dg(u,, ..., u,). Let {B;, 1 <i < πὶ be the 
Geramita—Pullman rational family in order n and of type [1, ..., 1]. (Note 


that in Theorem 1, p(n) = n if and only if n = 1, 2, 4, or 8.) (See Taussky [9] 
for a discussion of the 1, 2, 4, or 8 problem.) 

Let A; = δῆς p,;B,;. Then it is a straightforward verification to show 
that {A;, 1 <i <n} 15 ἃ rational family in order n and of type [u,, ..., u,].- 


The force of the above theorem is realized when one uses the following 
reduction theorem proved by Shapiro using his classification of similarities 
of quadratic forms. 


Theorem 4 (Shapiro, [8] Jf n=2"-19, ng odd, then there exists a 
rational family in order n if and only if there exists a rational family in order 2” 
of the same type. 


In [8], Shapiro proves the above theorem for m = 1, 2, or 3. In a recent 
private communication, he has proved the general result. 

We may now apply the above theorems to some particular cases of 
rational families. 


Theorem 5 (i) If nis odd, then there exists a rational family in order n 
and of types [a] if and only if a is a square. 

(ii) Ifn=2 (mod 4), then there exists a rational family in order n and of 
type [a, b] if and only if ab is a square and a and b are both sums of two rational 
squares. 


We note that the above results were already known in the context of 
weighing matrices and orthogonal designs (see Geramita et al. [2]). 

Proof (i) follows directly by applying Theorems 3 and 4 and equating 
the discriminants of the forms <a> and <1). 

(ii) Again by Theorems 3 and 4, we need only establish when <a, b> = 
<1, 1). Using Hasse-Minkowski and some standard Hilbert symbol argu- 
ments, one can show that this equivalence exists if and only if the stated 
conditions on a and b hold. 


Theorem 6 If n= 4 (mod 8), then there exists a rational family in order 
n and of type: 


(i) [α, b, c, d] if and only if abcd is a square and, at every prime p, s,(<a, 
b, c, d>) = 1; 
(ii) [a, b, c) if and only if, at every prime p, s,(<a, b, c>)(—1, abc), = 1; 
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(111) [a, b] if and only if ab is a sum of three squares. 

(iv) [a] always. 

Proof (i) follows directly from Theorems 2-4. 

(ii) By Theorem 4, we may reduce to order 4. We have seen that the 
existence of the rational family is equivalent to the existence of a 4 x 4 
matrix P such that PP' = dg(a, b, c, x) for some x € Q. In terms of quadratic 
forms, 4<1) ~ <a, b, c, x>. By equating discriminants, one sees that x must 
be abc, and the condition of (ii) will follow by equating the Hasse invariants. 

(iii) was originally shown by Geramita and Wallis using a Pfaffian argu- 
ment [3]. However, as in (ii), one may see that the existence of the rational 
family is equivalent to the existence of a rational number x such that 
4-1) ~ <a, b, x, abx>. Using standard arguments, one can show that such 
an x exists if and only if ab is a sum of three squares. 

(iv) follows since, for any positive a, 4. (1. = <a, a, a, a). 


Theorem 7 If n= 8 (mod 16), then there exists a rational family in order 
n and: 


(i) with five or fewer members always; 
(ii) of type [a,,..., ag] if and only if | |} a; is a square and, at every prime 
P, Sp(Cay, ---, Ag>) = 1; 
(iii) of type [a,, ..., a7] if and only if, at every prime p, 


«((αν, -.-147>)(—1, ΠῚ αὐ, = 1; 


(iv) of type [a,, ..., ag] where a,a,a3a4 is a square and s,(<a,, ..., 
a,>) = 1 if and only if ας ας is a sum of three squares. 


Proof Clearly (ii) and (iii) are the analogous of (i) and (ii) in Theorem 6 
and are proven similarly. 

In (iv), the additional hypothesis on a,, ..., a, merely states that a 
rational family of type [a,, ..., a,] exists in order 4. If a; ας is a sum of three 
squares, then a rational family of type [ας ας] exists in order 4. By tensoring 
the members of one family with J, and of the other with [_? 4], one then 
constructs the required rational family in order 8. The converse argument is 
similar to that used for (iii) in Theorem 6. 

(i) requires a slightly different technique and we thank Shapiro for shar- 
ing a critical observation. If a, b, c, d, and e are any positive rational num- 
bers, then let ὃ = discriminant<a, b, c, d, e>. Since every positive rational 
number is represented by f= <a, b, c, d, e> [7, p. 37], 6 is represented by f 
and, hence f ~ <6, x, y, z, xyz> for some x, y, z in Q [7, p. 33]. A routine 
check will show that 8 - <1) ~ <0, x, y, z, xyz, ὄχγ, 6xz, Oyz> and, hence, for 
some quadratic form g, 8 - <1) =~ f 1 g. The conclusion of (i) now follows. 


The only question left unanswered in orders 8 - ng, where ng 15 odd, is 
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what happens in (iv) of Theorem 8 if one removes the additional hypothesis. 
That is, if a,, ..., ας are positive rational numbers of which no four satisfy 
completely the added hypothesis, does there exist a rational family in order 8 
and of type [a,, ..., ag]? 

Also of interest are the orders divisible by 2’, where r > 3. In personal 
communication, Shapiro indicates that he has proved the following result: 


Theorem 8 (Shapiro) If n= 16 (mod 32), then there exists a rational 
family in order n and of type (ay, ..., a9)-if and only if at every prime p, s,(a1, 
“669 ag) = 1. 


This result is the first of its kind in orders 16 - no, if ng is odd, and will 
eliminate many 9-tuples as possible types of rational families in these orders. 
Shapiro has shown that a relationship between rational families and quadra- 
tic forms does indeed exist in every order. His result now gives a complete 
solution of the problem of existence of rational families in every order. (We 
indicate, in an appendix, the orthogonal designs in order 16 on nine var- 
iables whose existence is still in doubt.) 

We have noted that an orthogonal design gives rise to a rational family 
in the same order and of the same type. Hence all necessary conditions for 
the existence of rational families are necessary conditions for the existence of 
orthogonal designs. In the appendices to this paper we give some indication 
of how these results help determine the existence of orthogonal designs in 
some particular orders. 

The sufficiency of the conditions above for rational families would indi- 
cate that our approach has exhausted the algebraic properties of orthogonal 
designs. However, there are also combinatorial results that prohibit the 
existence of orthogonal designs of certain types and orders, and we refer the 
reader to a sequence of papers which have recently appeared by Geramita, 
Wallis, et al. For the purpose of this paper, we shall note one such result and 
show how it may be blended with the above theorems to achieve a more 
complete picture. 


Theorem 9 (Geramita-Verner; see [4]) Jf n =0 (mod 4), then there 
exists an orthogonal design in order n and of type (uy, ..., Us) where 
ΣῈ uj = n— 1 if and only if there exists an orthogonal design in order n and of 


type (1, uy, ..., u,). 
We couple this theorem with Theorem 6 and obtain the following result: 


Theorem 10 If n= 4 (mod 8) and if there exists an orthogonal design in 
order n and of type: 


(i) (a, b, c) where a+ b+c=n-— 1, then abc is a square and, at every 
prime p, s,(<a, b, c>) = 1; 
(ii) (a, δ) where a+ b =n — 1, then at every prime p, (a, δὴ, = 1. 
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Similarly, combining Theorems 7 and 9, we get 


Theorem 11 Ifn = 8 (mod 16) and if there exists an orthogonal design in 
order n and of type: 


(i) (a,, ..., a7) where δ᾽ a; =n— 1, then {1 a; is a square and, at 


every prime p, 5ρί(α,, ..., 27>) = 1; 
(ii) (a,,..., a¢)' where δὶ a; =n — 1, then, at every prime p, 


(= Πα]: ΕΓ 


(iii) (a,, ..., a5) where [|{ a; is a square, δ᾽Ὶ αἱ Ξε ἡ -- 1, and, at every 
prime p, s s (Cay, νον 44>) = 1, then ας is a sum of three squares; 

(iv) ae a2) αῷ where IL a; is a square, δῚ a; =n — 1, and, at every 
prime p, s.(<a,, 42, a3>) = 1, then agas is a sum of three squares. 


Examples of tuples which are eliminated by Theorems 10 and 11 as types 
of orthogonal designs are easy to find. For example, in order 20, the tuple 
(1, 2, 16) certainly satisfies condition (ii) in Theorem 6, but does not meet 
the condition (i) of Theorem 10. Similarly, the tuple (1, 1, 1, 1, 1, 1, 17) is not 
the type of an orthogonal design in order 24 since it fails to satisfy condition 
(i) of Theorem 11. 


Appendix I 


In this appendix we wish to update the status of known orthogonal 
designs in order 20. We use as a base the list given by Geramita and Verner 
[4]. 

The following 4-tuples are easily eliminated using Theorem 6 (i): (1, 1, 2, 
16), (1, 1, 5, 8), (1, 1, 5, 13), (1, 1, 8, 10), (1, 2, 6, 11), (1, 2, 2, 8), and (2, 3, 7, 8). 

We apply (ii) of the same theorem to the 3-tuples in doubt and find that 
the following tuples fail at the indicated prime: 


(1, 3, 9) ((, 2, 10) 

(1, 3, 10) (1, 5, 13) 

(1, 3, 16) (1, 8, τς 

(1, 4, 6) (5, 6,7 

(1, 6,13,5 at p=3 

(2, 5, 6) (1, 5, 8) 

(2, 6, 11 (2, 3, 8) | at p=2 
(3, 4, 10 (2, 7, 8) 

(3, 7, 10 


we 
pb 


we 


A en, 
Le 
ied 
— 

-" "- 


αἱ p=11 (2, 3, 13) at ρ-- 13 


eee ee (he Nee See” 
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Similarly, the tuples (1, 2, 16), (3, 16), and (6, 13) are eliminated by 
Theorem 10. 

J. Wallis reports in private communication that designs of type (3, 3, 6, 
6), (1, 3, 14), (3, 6, 8), (2, 5, 7), and (7, 10) have been found in order 20. Using 
Theorem 13 of Geramita et al. [2], we have found designs of the following 
types where we indicate the first rows of the circulant matrices used: 


type (1, 4, 5, 5): 
ad00Q —-d —dc00c cdb—bd Ocbb—c 
type (1, 2, 2, 9): 
ab00—b bbb0O cdb—bO d—c0b-—b 
type (1, 2, 8, 9): 
abc—c-—b bbbc—c cdb—be d—c —cb —b 
type (1, 4, 13): 
ac—cc-—cC Oc —c—c—-c Ocbb—-c bece —b 


At present, it is still unknown whether the following tuples are types of 
designs in order 20: 


Appendix II 


We should like to disclose the early status of orthogonal designs on four 
variables in order 28. Using Theorems 6 and 9 we can eliminate all 4-tuples 
except the 73 listed below (this list has been verified by an APL search on a 
Burroughs 6700 computer as programmed by J. Verner): 


(1, 1, 1, 1). (1,3,6,18) (2, 4,8, 9) 
((,1,1,4ν 4,4, 4) (2, 5, 5, 8)s 
(1, 1,1,9ν. (4449) (2,8, 8,8) 
(1, 1, 1, 16). (1,4,4,16) (2, 8,9, 9) 
(1,1,1,25. (14,5,5) (3, 3,3, 3) 
(1.1.2... (1, 4, 8, 8) (3, 3, 3, 12) 
(1, 1, 2, 8)ν (1, 4, 9, 9) (3, 3, 6, 6) 
(1, 1, 2, 18 (1,4, 10,10) (3, 4, 6, 8) 
((, 1, 4,4 ν (15,5,9 (β,6,8,9) 
(Ι, 1,4, 9)». 4{(,,8,8,9) (4, 4, 4, 4)ν 
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order 2; see construction 22 of [2]; 
(iii) Baumert—Hall array of type (7, 7, 7, 7); see Wallis et al. [10, p. 261]; 


The indicates those tuples that are the types of orthogonal designs 
(iv) construction 16 of [2]; 


constructed by: 
(ii) designs in order 14 and the amicable pair of types (1, 1); (1, 1) in 


Shapiro’s result in Theorem 8 is the first algebraic restriction on the type 


(i) direct sum of designs in orders 16 and 12; 
numbers of orthogonal designs in order 16. Below we list all 25 possible 


(v) Golay sequences; see [34]. 
types of designs on 9 variables in order 16 not eliminated by Theorems 8 and 
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((, 1,1,1,1,1,1,2, 7) (4, 1, 1,1,1,2, 3, 3, 3) 

(1, 1, 1, 1, 1, 1, 1, 3, 4) (1, 1, 1, 4, 2, 2, 2, 2, 2) 

(1, 1, 1, 1, 1, 1, 1, 3, 6) (1, 1, 1, 1, 2, 2, 2, 2, 4) 
(131,90, 9-9: 9-919) 7 


ν΄ indicates that a design of that type exists in order 16 (see [2]). 
Combining Theorems 8 and 9, one may also eliminate the following 
8-tuples as types of orthogonal designs in order 16. 


(1, 1, 1, 1, 1,3,3,4) (1, 1, 1, 2, 2, 2, 3, 3) 
(1, 1,1,1,2,2,2,5) (1, 1, 2, 2,2, 2, 2, 3) 
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On Finite Projective Planes 
of Lenz-Barlotti Class 13 


JILL C. ἢ. 5. YAQUB 


THE OHIO STATE UNIVERSITY 
COLUMBUS, OHIO 


We study the existence of finite planes of Lenz—-Barlotti class 13, in particular for orders n < 250. 
By extending a result of W. M. Kantor, we are able to exclude certain values of n that were not 
hitherto rejected. We show that n #9 by direct calculation, and list all n < 250 for which the 
existence question is undecided. We also consider finite planes of class 13 whose additive loop has 
both inverse properties ; in this case further orders can be excluded, in particular n = 25 and those ἢ 
with n — 1 = 3", 2(3"), 3p or 6p, where p is a prime > 3. 


While finite planes of Lenz—Barlotti class 14 have been studied exten- 
sively, not much is known about finite planes of the related class 13, even for 
small values of the order. (Planes of both types have recently been surveyed 
by Kantor [6, 7]. There is no known finite example of either class.) The 
purpose of the present paper is to give a slight extension of a theorem of 
Kantor [6, Theorem 6.3], and to investigate the existence of planes of class I3 
for specific values of the order n, where n < 250. Basic information concern- 
ing projective planes can be found in Dembowski [2], and Pickert [9]; we use 
the coordinate system of [9]. The principal results on finite planes of class 13 
are summarized in Theorems A and B. 


Theorem A_ Let x be a plane of class 13 with finite order n. Let the 
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coordinate quadrangle VUOE be chosen so that x is (V, OU) and (U, OV) 
transitive, and let R be the corresponding ternary ring. Then: 


(i) Ris linear, multiplication is associative, and R satisfies the distribu- 
tive law a(b + c) = ab + ac. The multiplicative group R* is not commutative, 
the additive loop (R, +) is not associative, and R does not satisfy the distribu- 
tive law (b + c)a = ba + ca. 

(ii) R* contains at most one involution. 

(iii) The Sylow 2-groups of R* are cyclic or generalized quaternion. 

(iv) If nis even and if p is a prime that divides n — 1, then there exist 
integers x, y, z, not all zero, such that nx* + (—1)?~)/*py? = z?. 

(v) Let G= R* x R*, let D= {(a, b)e Gl[b=a + 1), G, = {1} x R*, 
G, = R* x {1}, and G; = {(r, r)|r ε R*}. Then [Ὁ] =n— 2 and 


(Z4)( pat) =n+ Σ fae 0 et De eam a 


deD géeG geGi ge G2 geG3 


where the sums are taken in the rational group algebra QG of G. 

(vi) If R* admits a noncyclic abelian 2-group as a homomor phic image, 
then n is a square. 

(vii) n— 1 is not a power of 2 ifn #9. 


Proofs of the statements in (i) can be found in [9, §3.5]. Statement (ii) 
follows from the Ostrom-Liineburg theorem [2, p. 120], and (iii) follows 
from (ii) (Burnside [1, p. 132]). Statements (iv) and (v) are proved by Hughes 
([5, Theorem 3.2]; [4, §III. 1] respectively), and (vi), (vii) by Kantor [6, 
Theorem 6.3 and Corollary 6.4]. Note that only one distributive law is 
necessary for the proof of (v). 

If ae (R, +), let a+(—a) =0. Then (R, +) is said to have the right, 
(left), inverse property if and only if, for alla, b ε (R, +), (ὃ + α) - (--α) Ξε ὃ 
(a + (—a + b) =b, respectively). 


Theorem ΒΒ Let x be a plane of class 13 with finite order n, coordinatized 
as in Theorem A. Suppose that (R, +) has both inverse properties. Then: 
(ij) (—1)+1=0,(—1) = 1, and α(-- 1) = (—1)a= —aforallae R. 
(ii) (R, +) is commutative. 
(iii) Ifa? =1, thena=1or —1. 
(iv) Leta#1. Τῆθη α -" 1 -Ξ (-- 1)α2 «» αὐ =1. 
(v) Ifa? τ-- δ =1, ἴα, b Ξ 1, and if ab = ba, thenb τ ora’. 
(vi) The Sylow 3-groups of R* are cyclic. 
(vii) n— 143" or 2(3’). 
(viii) nis evens>1+1=0. 
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(ix) If 3 divides n, then 1+1=-—1; if 1+1=—1, thenn=1 or 
3 (mod 6). 

(x) n#0 (mod 6). 

(xi) n #2 (mod 4). 

(xii) n #15 or 21 (mod 24). 

To prove (i), observe that 1 = 1 + (—1 + 1) by the left inverse property, 
whence —1+1=0. Then (—1)? + (—1) =0, by the distributive law, so 
that (— 1)? = 1. Thus, by Theorem A(ii), a(— 1) = (—1)a for all a ε R, and 
a(—1) = —a by the distributive law. (This slight change in the proof of [6, 
Theorem 2.7(i)] is required because R now has only one distributive law.) 
Statements (ii}-(v) are proved in Kantor [6, Theorem 2.7, (ii}-(v)]. Note that 
the proof of Theorem 2.7(v) uses commutativity only in the subgroup <a, ὃ), 
and hence proves the present statement (v). (The statement of Theorem 
2.7(v), omitted in Kantor [6], can be found in Kantor [7].) Now (vi) follows 
from (v) (Burnside [1, p. 131]), and (vii) follows from (vi) because R* is a 
nonabelian group with at most one involution. Since |R*| = n — 1, (i) and 
(iii) imply (viii). (See also Hughes [4, p. 513].) Statements (ix), (x) are proved 
by Hughes [4, Theorem II.11], and (xi) follows from [6, Lemma 4.9] and a 
theorem of Hughes [2, p. 173, 12]. Finally, (xii) is proved in [7]. 

If a finite (V, OU), (U, OV) transitive plane z is of class > I3 and if R* is 
noncommutative, then z is either a “near-field plane” (of class [Va2 or IVa3), 
or a “dual near-field plane” (of class IVb2 or IVb3). In the former case π is 
(VU, VU) transitive and (R, +) is associative. In the latter case π is (0, 0) 
transitive, so that the coordinatization is not the standard one; here (R, +) 
has the right inverse property, but need not (and probably cannot) have the 
left inverse property. 

Theorem A(vi) is proved by applying a certain homomorphism to the 
identity in Theorem A(v). It is suggested in Kantor [6] that some similar 
argument might be used to show that R* cannot admit C, x C, as a homo- 
morphic image, where C, denotes a cyclic group of odd prime order p. This 
would be a major advance in the theory. Unfortunately the natural analo- 
gous procedure does not yield a result of such generality. However, we show 
in Theorem 1 and its corollary that the approach does furnish a weak 
numerical restriction, and that this has some modest application in exclud- 
ing non-prime-power orders for class 13. 


Theorem 1 Let x be a plane of class at least 13 and of finite order ἡ. 
Suppose that R* admits C, x C, as a homomorphic image, where C, is a group 
of odd prime order p. Then the Legendre symbol (q/p) = 1 for each odd prime q 
that divides the square-free part of n. Equivalently, there exist integers x, y, 2, 
not all zero, such that 


nx? + (—1)?— D/Apy? = 2?. 
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Proof Let x be of class at least [3 and of finite order n, coordinatized as 
in Theorem A. Let G=R*xR*, D={(a,b)eGlb=a+ 1), 
G, = {1} x R*, G, = R* x {1}, and G; = {(r,r)|re R*}. Let H = <a> x 
<B>, where a? = B? = 1 and a, B # 1. Suppose that there exists an epimor- 
phism φ: R* — H, with kernel K. Let A, B be the complete inverse images in 
R* of <B>, <a> respectively. Then there exist epimorphisms ¢,: R* > <a> 
and @,: R* — <B>, with kernels A and B, respectively. Define wy: G- H by 
wir, 5) = (¢,(r), φ.(5)) for each (r, 5) ε G. Then w is an epimorphism with 
kernel A x B, and w can be extended to an epimorphism VP: OG — QH, 
where QG, QH are the rational group algebras of G and H. 

Let αι; be the number of ἀ ε D such that W(d) = αἰ β', for i, j = 0, 1, ..., 
p — 1; then α;; is also the number of d € D such that (d~*) = «‘B-/. The 
restrictions of y to G,, G, clearly have kernels of order (n — 1)/p, while the 
restriction of Ψ to G; has kernel of order (n — 1)/p?, because A ἡ B= K. 
Hence, on applying to the identity in Theorem A(v), we find 


( > a,a'p')( oo dant *8*) =n+(n—1)*p? oy αἱ Bs 
k, m=0 


i, j=O i, j=0 


= (n= 1)" ph (n= typ? Da 


Ρ- 1 » ν 
—(n—1)p-? Σ api (1) 
i, j=0 
It follows that 
1 


(S aue'b)( Saunt 88") = "Sales sw) 
where 
u(0, 0) =n + (n — 1)2ρ 2 — 2(n— 1)p7! — (n— 1)ὴρ 2, 
p(0, t) = p(t, 0) = (n— 1) ρ΄ ὅ — (n— 1)p-* — (n— 1)p* 
if t+#0, (3) 


u(r, 5) ΞΞ (μα, -- 1)2ρ 2 -- (, --ἰ ἡ» 2 if. rs #0. 


Equating coefficients for the term a’f* in (2), we have 
p-1 
ur, s)= Σ ayarm+sdim for r,s=0,1,...,p—1, (4) 
k,m= 


where subscripts in (4) and hereafter are taken mod p. 


Let v(r, s)= )?26 w(r+j,s+j), for r,.s=0, 1, ..., p—1, and let 
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Υ = Pid a; i+, fort =0, 1,..., p— 1. Then, by (4), 


p-1 p-1 p-1 
v(r, 5) = Σ » every) Qkm = > | ee 


k,m=0 \j=0 k,m=90 
p-1 

= 3 Yo-r414e, +t 
k,t=0 


Hence »εῇ 
v(r,s)= >) ΚΞ (5) 
t=0 


Let M be the p x psymmetric circulant matrix with M;, = Y,,;-2,and let N 


1 


be the p ΧΡ matrix with Ν;; = v(i, j), for i, j = 0, 1,..., p — 1. Then, by (5), 
MMTE=N. (6) 


On calculating the v(i, j) from (3), we find N = nI + (n— 1)(n — 4)p- J, 
where I is the identity matrix and J is the matrix each of whose entries 15 one. 
Hence by Hughes [5, Lemma 3.1] N is positive definite, det N is the square 
of an integer, and for each odd prime gq, the Hasse invariant C,(N) = (n, — 1) 
(P~1)/2(n, p),, where (a, δ), denotes the Hilbert norm-residue symbol. It fol- 
lows from the properties of this symbol (see, e.g., [2, p. 18]), and from the 
quadratic reciprocity theorem that C,(N) = (q/p) if q divides the square-free 
part of n and that C,(N) = 1 otherwise. Now, by (6), the matrix N is ra- 
tionally congruent to the identity matrix. By the Hasse-Minkowski theorem 
(Hasse [3]) this is possible only if C,(N) = 1 for each odd prime q. Hence 
(q/p) = 1 for each odd prime g that divides the square-free part of n. Equiva- 
lently (cf. Hughes [5, Theorem 3.2]), the equation nx? + (— 1)?~"*py? = ἢ 
has a nonzero solution in integers. 

If n and p satisfy the conclusion of Theorem 1, the Minkowski—Hasse 
theorem ensures that there exists a rational matrix M that satisfies (6). One 
may still ask whether Eq. (1) has a solution in nonnegative integers. For 
n = 28 and p = 3 (which is the smallest case of interest in the context of 
Theorem 2 below), Eq. (1) has the solution dog = 2, do, = 1, ao, = 5, 
Arg = 3, Ay, = 4, 412 = 2, Ago = 3, an, = 4, 45} = 2. 

The conclusion of Theorem 1 is always satisfied if n is a prime power and 
p divides n— 1. When n is even, Theorem 1 gives less information than 
Theorem A(iv), and when n= 1 or 2 (mod 4), the values of n to which 
Theorem 1 can be applied may well be excluded by the Bruck—Ryser 
theorem [2, p. 144, 13]. However, Theorem 1 does imply the nonexistence of 
planes of class I3 with order n for infinitely many n = 3 (mod 4) that are not 
otherwise excluded. 


Corollary (a) There is no plane of class 13 with order n= 1+ 2p’, 
where p is an odd prime, if the square-free part of n is divisible by a prime q such 
that (q/p) = —1. 
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(Ὁ) There is no plane of class 13 with order n = 1 + 2p’ if p is a prime = 
7 (mod 12) and if r(p — 1)/3 # 1 (mod 3). 

Proof (a) If 7 is of class 13 with order n = 1 + 2p’, then |R*| = 2p’. 
By Theorem A(ii), R* = <a> x T, where o is the unique involution in R* 
and T is the Sylow p-group of R*. Since R* is nonabelian, T is nonabelian, 
whence r 2 3. It follows that T, hence also R*, admits C, x C, as a homo- 
morphic image (as is easily shown by induction on r). Now apply Theorem 


(b) Suppose that p=7 (mod 12) and that r(p — 1)/3 #1 (mod 3). 
Then 3 divides πη, but 37 does not because 1+2p"=1+ 
2[1 + (p — 1)!’ = 3 + 2r(p — 1) (mod 37). Since p = 3 (mod 4), the quadra- 
tic reciprocity theorem ensures that (3/p) = —(p/3) = —1. 


As was noted some time ago in Yaqub [11], it is not hard to show that 
most non-prime-powers < 100 cannot be the order of a plane of class [3. In 
Theorem 2 we list all orders n < 250 that cannot yet be excluded. Non- 
prime-powers are in bold-faced type, and values n for which there exists a 
(genuine) near-field of order n carry an asterisk; the remaining values are 
primes, except for 125 = 53, for which no near-field exists. (See, e.g., [2, pp. 
228-231].) 


Theorem 2 If there exists a plane of class 13 with order n < 250, then 
n = 13, 25*, 28, 29, 37, 41, 43, 45, 49*, 53, 61, 64*, 73, 76, 79, 81*, 85, 89, 97, 
101, 109, 111, 112, 113, 115, 117, 121*, 125, 127, 137, 145, 148, 149, 151, 153, 
163, 169*, 172, 173, 185, 187, 193, 196, 205, 211, 221, 223, 225, 229, 233, 235, 
241, 244, or 245. 

Proof The values n = 40, 56, 58, 82, 130, 156, 184, 202, 204, 220, 226, 
232, 248 are excluded by Theorem A(iv), the values n = 17, 65 by Theorem 
A(vi), and the value n = 55 by the corollary to Theorem 1. All other values 
except ἢ = 9 are excluded either by the Bruck—Ryser theorem or by the fact 
that every group of order n — 1 that contains at most one involution is 
abelian. 

Suppose n = 9. Then, by Theorem A(iii), R* is the quaternion group 
<a, b|a* = b* = 1, a? = b’, δ΄ αν = a *). Let z denote the unique involu- 
tion in R*. (Note that we cannot assume z = — 1.) Then R* = {1, z, a, b, za, 
zb, ab, ba}. The group Aut Κ᾿ is transitive on the elements of order 4, and the 
subgroup that fixes a is transitive on R* — <a>. We construct all nonisomor- 
phic loops (R, +) that are compatible with the distributive law, by consider- 
ing the successive possibilities for 1 + x as x runs through R*. Whenever 
possible, we use the existence of automorphisms of R* to reduce the number 
of cases. Thus, for example, if 1 + 1 is of order 4, we may assume without 
loss of generality that 1+ 1 =a. Then 1+2z+#1 or z, and 1 + 2 ¥ za be- 
Cause z + z = za; hence we can assume that 1 + z =O or ἢ. In this way we 
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find only the following six loops: 


(i) 14+1=0, 1+z=a, 1+a=b, 1+b=z, 1+2za=ab, 
1+ zb = za, 1+ ab=ba, 1+ ba= ΖΡ. 

(ji) 141=0, 1+z=a, 1l+a=b, 1+b=2b, 1+2a=ba, 
1+ zb=ab,1+ab=z2z,1+ba=2a. 

(iii) 1+1=z 1+z=0, I+a=b, 1+b=a, 1+2za=ba, 
1 + zb = ab, 1 + ab = zb, 1 + ba = Za. 

(iv) 1+1=z2z, 1+2z=0, 1+a=b, 1+b=ba, 1+2za=ab, 
1 + zb = za, 1+ ab=zb,1+ ba=a. | 

(Ὁ) 1+1=z, 1+z=0, 1l+a=b, 1+b=ab, 1+2za=ba, 
14+ zb=z2a,1+ab=a,1+ba= zb. 

(vi) 1+1=z 14+2z=0, t+a=b, 1+b=a, 1+2za=ab, 
1 + zb = ba, 1 + ab = za, 1+ ba = zb. 


The first four loops are nonplanar; this can easily be verified by finding 
r,x, γε Rwithr 4 1,x 4 y, and (r+ x) '(1+x)=(r+y) ‘(1+ y). (See, 
e.g., Yaqub [10, p. 377].) It follows that loops (v) and (vi) must correspond to 
the near-field plane of order 9 and its dual (of class [Va3 and class IVb3, 
respectively) since these planes must appear as solutions. Hence there is no 
plane of class I3 with order 9. 

Henceforth we shall assume that (R, +) has both inverse properties. 
Then Theorem B applies, and in particular (R, +) is commutative. The 
following lemma is essentially contained in Kantor [7], but we include the 
proof because cases (ii) and (iv) of part (b) do not arise there. (See also 
the proof of Hughes [4, Theorem [1.11]. 


Lemma _ Suppose that π is finite, of class at least 13, and that (R, +) has 
both inverse properties. Suppose also that a, b € R* and thata + b = —1. Let 
x be the set consisting of the distinct elements from among a, a~', b,b~ +, a~ Ὁ, 
b~ +a. Then: 

(a) a '+a'b=—-1=b''+5b"'a. 

(Ὁ) Either (i) [Σ] = 6, or (ii) Σ = {c, ε ἢ) where c #1, οὐ = 1, and 
c+c ' = —1, or (iii) & = {1} and 1 - 1 = —1, or (iv) 2 = {1, c, c*} where 
c#1,—-landi+c=—-1=c'4+c"l. 

(c) R* —{-—1} can be partitioned into sets of type (i)-(iv). If ais of order 
3, then a belongs to a set of type (ii). There is at most one set of type (iv). 

(d) ΠΝ is a subgroup of index 2 in R*, and if X is a member of the 
partition such that |X| = 6, then |N A Z| =2 or 6. 

Proof (a) If a,be R* and if a+b=-—1, then ας —1#b. By 
Theorem B(i), (— 1)? = 1 and χί-- 1) = (—1)x for all x ε R*. Thusa+b= 
~—1=-a= —1+(-—1)b, by the inverse property, = —1=a~' + a™'‘b by 
the distributive law. Similarly, δ 1 + b-'a = —1. 
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(Ὁ) Suppose that [Σ] <6. Then, without loss of generality, we may 
assume that a=a™', b, δ", a~'b, or Ρ΄ ‘a. If a=a™', then a= 1 since 
a+ —1; similarly, if a= δ ‘a, then b = 1. Now if a= b = 1, then Σ is of 
type (iii), and if a = 1 # ἢ (or if b = 1 $ a), then Σ is of type (iv), with c = ὃ 
(c = a, respectively). Again, if a = b # 1, then Σ is of type (iv), with c = a™'. 
Suppose next that a = b~1, with a + 1. Thena + a~' = —1 =a? = (—1)x 
(1 + a)=>1+ a= (—1)a?. Hence a is of order 3, by Theorem B(iv), and Σ is 
of type (ii). Suppose finally that a = a~'b with a # 1. Thena + a? = —1,so 
that again 1 + a= (—1)a? and Σ is of type (ii). 

(c) Clearly, R* — {—1} can be partitioned into sets Σ of type (i)-(iv). If 
a is of order 3, then a + α΄ = —1 by Theorem B(iv), whence a belongs to a 
set of type (ii). If © is of type (iv), then c = (—1)(1 + 1). Hence there is at 
most one such set. 

(d) Suppose |Z] = 6 and that N is of index 2 in R*. If a, δ € N, then 
ΙΝ © ΣΙ = 6. Otherwise, |N 7 Z| = 2 because N is of index 2. 


We shall call the above sets of type (i)-(iv) “X-sets”; a Z-set is “proper” if 
and only if |x| = 6. 


Theorem 3 Suppose that x is of class 13 with order n, and that (R, + ) has 
both inverse properties. 
(i) If 1+ 1 is of order 4, then x contains a subplane of order 5. 
(ii) If 1+ 1 is of order 3, then x contains a subplane of order 7. 
(iii) 1+ 1 is not of order 6. 
(iv) Ifn—1= 4p, where p is a prime 2 3, then 1 + 1 is of order 4. 
(v) If pis a prime > 3, then ἢ -- 1 # 6p. 
(vi) If nis even and if R* = a, N), where αὖ = 1 and N is a normal 
subgroup of index 3 in R*, then a cannot act fixed point free on N. 
(vii) If pis a prime > 3, thenn — 1 3p. 
(viii) If p is an odd prime = —1 (mod 3), then n — 1 # 3p’. 


Proof (i) Suppose that 1 + 1 = a, where a has order 4. Then n is odd, 
and a? = —1, Ξ 1 by Theorem B(viii). Using the inverse properties and the 
commutativity of (R, +), we find 1+ a=a™‘andi+a™~* = —1. Hence the 
set {0} ὦ <a) is closed under addition and multiplication, and must there- 
fore coordinatize a subplane of order 5. 

(ii) Suppose that 1+ 1 = b, where b has order 3. Aas n is odd and 
—1 eo 1. is a = (—1)b, so that a has order 6 and b = a*. By Theorem Β(ν) 
1+a*=a* and 1+a?=<a. Also, since 1+ 1=a*, we have 1 -- αὖ -- 
1 = a* + αὐ, whence 1 + a= a? and 1 - α΄ =a’. Hence {0} ὦ <a) is closed 
under addition and multiplication, and must coordinatize a subplane of 
order 7. 

(iii) Suppose that 1 + 1 =a, where a has order 6. Then n is odd, and 
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a = (—1)b, where b has order 3. But then, by Theorem B(iv), 1 + b~* = 
(— 1)b, a contradiction since b + 1. 

(iv) Suppose that n — 1 = 4p, where p is a prime 2 3. Then, since R* is 
nonabelian and contains a unique involution, R* = <a, b|a* = 1 = D?, 
α΄ 'ba=b~'>. The subgroup N = <a’, b> is of index 2 in R*, and R* — N 
consists of all elements of R* that have order 4. ΠΣ is a proper L-set, then by 
the lemma, [Σ ἡ (R* — N)| = Ο ογ 4. Since |R* — N| = 2p =2 (mod 4), it 
follows that there exist elements c, c~' of order 4 which belong to a L-set Z, 
that is not proper. Clearly, XZ, is of type (iv). Hence 1 + 1 is of order 4. 

(v) Suppose n — 1 = 6p, where p is a prime > 3. Since R* contains the 
unique involution — 1 and since 3p is odd, R* = <—1) x S with [5] = 3p. 
Since p > 3, the Sylow p-group of S is normal, and since R* is nonabelian, it 
follows that p = 1 (mod 3) and that S has p Sylow 3-groups. Hence S con- 
tains p X-sets of type (ii). Let n,, n, be the numbers of elements in 5, R* < δ, 
respectively, which are contained in proper <-sets. If 1+ 1= —1, then 
n, = p— 1 and n, = 3p — 1 since R* — S contains no element of order 3. If 
(—1)1+1)=c#1, then nj, =p—3 and n,=3p—1 if ce 5, while 
n, = p— landn, = 3p — 3ifc ¢ 5. Hencen, <p — landn, 2 3p — 3. But, 
by the lemma, if Z is a proper X-set such that | ἡ (R* — S)| > 0, then Σ 
contains four elements of R* — S and two elements of 5. Hence n, < 2n,. 
This yields the contradiction 3p — 3 < 2(p — 1). 

(vi) Suppose that nis even and that R* = <a, N> where αὖ = 1 and N is 
a normal subgroup of index 3 in R*. We can assume that |N| > 1, since 
otherwise π is of order 4, hence desarguesian. Suppose that a acts fixed point 
free on N, ie., that a~’xa =x with xe N implies x = 1. Then [Ν|] = 
1 (mod 3), and <a) is a Sylow 3-group of R*. Now xax~!=awithxe N 
implies x = 1, and xax~'! = α΄ implies x?ax~ 2 = a, whence x = 1 because 
n is even. Hence R* contains | N| Sylow 3-groups, and every element of R* 
either belongs to N or has order 3. By Theorem B(iv) if c is of order 3, then 
l1+c=c'* since here —1=1. It follows that if xe N, x #1, then 
1 ἜΧΕΝ. Thus, again because 1 + 1 = 0, the set {0} ὦ N is closed under 
addition and multiplication, and must therefore coordinatize a subplane of 
order 1 + ΙΝ |. By Bruck’s theorem [2, p. 145, 18], this implies that 
(1+ |N|)? <14+3]|N|, ie, that | N| = 1, a contradiction. 

(vii) Suppose p is a prime > 3. If n — 1 = 3p, the Sylow p-group N of 
R* is normal and of index 3, and R* = <a, ΝΡ where a is any element of 
order 3. Since | N| = p and since a cannot act fixed point free on N by (vi), 
R* is abelian. 

(viii) Suppose p = —1 (mod 3) and that n — 1 = 3p?. Again, the Sylow 
p-group N of R* is normal and of index 3, and R* = <a, N> where a has 
order 3. The centralizer of a in N is nontrivial by (vi), and cannot be of order 
p since then a would act fixed point free on the p* — p remaining elements of 
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N, where p* — p = 2 (mod 3). Hence acentralizes N, and, since N is abelian, 
R* 1s abelian. 


Theorem 4 There is no plane π of class 13 and of order n such that (R, +) 
has both inverse properties for the following values of n < 250: n = 13, 25*, 28, 
29, 43, 45, 76, 79, 111, 112, 115, 117, 163, 187, 223, 244. 


Proof The values n = 45, 111, 117 are excluded by Theorem B(xii), and 
the values n = 28, 163, 244 by Theorem B(vii). If n = 43, 79, 115, 187, or 223 
we can apply Theorem 3(v). If n = 13 or 29, then by Theorem 3(iv) and (i), z 
contains a subplane of order 5; in each case this contradicts Bruck’s 
theorem. The values n = 76, 112 are excluded by Theorem 3(viii), (vii), re- 
spectively. Thus it remains to be shown that n # 25. 

If n = 25, then, since R* contains a unique involution, there are four 
nonisomorphic possibilities for R* [1, §126]. These are: 


_,  @ b, cl|a* = b* =1, a? =b?, αν -- α΄',, 8 =1, a ἴσα ΞΞ ς, 
i) (eu b cla = 6 = 1, a? =b’, δ΄ 'αθ τ- α'', οὐ =1, a ἶσα Ξε: 6, 
“: are me 1, a? =b?, b'ab=a™', οὖ =1, c'ac=b, 
ne pee =1=b°,a 'ba=b""). 


We note that the regular and irregular near-fields of order 25 have multi- 
plicative groups of type (iv) and (iii), respectively. 

Suppose first that R* is of type (i) or (ii). Then, in each case, R* contains 
exactly two elements of order 3, namely c and ο΄ | and the group 
N, = <a, c> is of index 2 in R*. Since |R* — N,| = 12 = 0 (mod 4), and 
since 1, —1, c, c 1 EN, it follows from the lemma that each L-set that 
contains an element of R* — N, is proper; there are three such sets, each 
containing two elements of N,. The remaining six elements of N, include 1, 
—1, c, and c™!. Since {c,c~'} is ἃ L-set of type (ii), we cannot have 
1+1= -—1. Hence N, contains a L-set of type (iv), say {1, d, d~*}, with 
1+d= —1, where also 1 + 1 = (—1)d. Now N, = <5, c) is also of index 2 
in R* for both types, and the same argument shows that {1, d,d~*} < Ν;. 
Since N, ὦ N,=<—1) x <c>, the element d is of order 3 or 6, whence 
1 + 1 is of order 6 or 3. By Theorem 3(iii), 1 + 1 is not of order 6. If 1 + 1 is 
of order 3, then by Theorem 3(ii), π contains a subplane of order 7; this 
contradicts Bruck’s theorem. Hence R* is not of type (i) or (ii). 

Suppose next that R* is of type (iii). Write z = a? = b? = —1. Then, 
besides 1 and z, R* contains six elements of order 4, eight elements of order 
3, and eight elements of order 6. The elements of order 3 form four X-sets of 
type (ii), and it follows from the lemma that there exists a X-set {1, d, d~ οἵ 
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type (iv), with 1 + 1 = zd. As has just been shown, d cannot be of order 3 or 
6. Hence 1 + 1 is of order 4, and we can assume without loss of generality 
that 1 + 1 =a. Weare left with four elements of order 4 and eight elements 
of order 6 with which to construct two proper L-sets, say X, and Σ2. The 
elements of order 4 cannot all be in the same -set. So suppose that 
b, b~! € X, and that ab, α΄ δ ε Σ;. Then there exists an element x of order 
6 such that b+ x=z. If x =zc, then b 1+ b°!zc=z, ie, b'+bce=z, 
and this is impossible because bc has order 3. Similarly, we find that 
x + bc~, a~'be, abe~ !. Hence either (1) b + ze~! = z, (2)b + ac” * =z, (3) 
b+b-'c=z, or (4)b+a ἷς =z. 

In case (1), £, = {b, b~1, zc, zc, δε" 1, a~ *c}, whence X, = {ab, α΄ *b, 
ac”, b~'c, a~ be, abc™ *}. There exists an element y € 2, such that y has 
order 6 and ab + y =z. If y=ac™', then 1+ b7'c™1 =b~'a~'z, which 
contradicts the fact that b~!c~! has order 3. Similarly, y # abc™'. If 
y=a‘be, then 1+2zc=b-‘a7-1z. But b+2c°0'=z>z+c 1=b=> 
zc+1=cb. This is a contradiction, since b~'a~'z+cb. Similarly, 
y # θ᾽ ᾿ς. Hence case (1) cannot occur. The same kind of calculations show 
that case (2) cannot occur, and that in case (3) the only possibility is 
ab + bc™! = z. The loop obtained in case (3) is not planar since, for exam- 
ple, (1+ b)-'(a+ b) -Ξ α΄ 'be = (1 + α΄ 'εῖ ἢ) '(α -- α΄ ἷε ἢ). Finally, in 
case (4), we find that the only possibility is ab + zc” ’ = z; this must yield the 
plane of class IVa2 over the irregular near-field of order 25 since this plane 
must appear as a solution. Hence there is no plane of class 13 with R* of type 
(iii). 

Suppose, finally, that R* is of type (iv). Write a* = z = —1. Then, in 
addition to 1 and z, R* contains two elements of order 3, two elements of 
order 6, four elements of order 12, and 12 elements of order 8. Let 
N = <a’, b); then N is of index 2 in R* and all elements of R* — N are of 
order 8. By the lemma, since | R* — N| = 12, the element 1 + 1 must belong 
to N, and there are three proper i-sets that contain elements of R* — N. 
Since these use altogether six elements of N, and since {b, b~'} < N is the 
only Z-set of type (ii), there exists a Z-set {1, 4, d~'} < N of type (iv), with 
1 + 1 = zd. It was shown earlier that d cannot be of order 3 or 6. Hence d has 
order 12 or 4. If d has order 12, there exists a X-set Σ that contains four 
elements of order 8 and the two elements of order 4. Moreover, there exist 
x, y€ =z such that x, y have order 8, x + y=z and χ' ἦγ has order 4. It 
follows that y = x! or x3. Since x + x71 = z only if x* = 1, we must have 
x+x>=z. Then 1+x?=x~!z=x>. Let 1+1=14, so that t = 2d has 
order 12. Then x? + x? = xt. By planarity, (x? + 1)~1(1 + 1) # (x? + x?) 
~1(1 + x7), ie, x°t # τ ‘x. But, since t has order 12, t = x, δι = b, x, where 
x, =x? or x® and b,=b or ὃ}. Thus x°t=2zxx,b, and 
t-'x = χερὶ ἰχ =xj'!xb, = zxx,b,. Hence 1 + 1 is not of order 12. 
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We can now assume that 1 + 1 is of order 4. In this case there exists a 
proper 2-set X, that contains four elements of order 8 and the two elements 
of order 6. Since Aut R* is transitive on the elements of order 8 and on the 
elements of order 3, we can assume without loss of generality that a, zb € x, 
and that a + ΖΡ = z. Then 2, = {a,a™', ΖΡ, zb~', a*b, a°b}. (Note that we do 
not specify at this stage whether 1 + 1 - αὖ or a® since this would distin- 
guish between aand α΄ ‘in (R, + ).) There are two other proper L-sets, Σ) and 
x3, each of which contains four elements of order 8 and two elements of 
order 12. Since neither 2, nor 2, can contain four elements from the same 
Sylow 2-group of R*, we can assume that αὖ, α΄, a*b~', a®°b-! € X,. Then 
either (1) a2b~', α΄ ΕΣ, or (2) ab-', a“ *b7*' EX. 

In case (1) we have either (i) αὖ + a°b~ * = zor (ii) α΄ + a®b~' = z (since 
a*b~', a®°b~! € X,). Furthermore, 2, = {ab~', a~'b™', ab, α΄ δ, ab, a®b}, 
where either (iii) ab! + a~'b =z or (iv) α΄ b+ + ab = z. Now (i) and (iv) 
are incompatible since (i) implies 1 +a’ =a°b™', while (iv) implies 
1+ α΄ = a®b~'. Similarly, (ii) and (iii) are incompatible since (ii) implies 
1 + αὐ = a7b~', while (iii) implies 1 + a~'b~* = a*b™ *. If (i) and (iii) hold, 
we find (a? + a~!)"1(1 + α΄ ἢ = a7! = (a? + ab) (1 + ab); if (ii) and (iv) 
hold, then (a? + a°)~1(1 + αὖ) = αὖδ = (a? + a7b)*(1 + ab). Thus the 
loops obtained from case (1) are nonplanar. 

In case (2) we find by considering Σ2 that either (v) αὐ + ab~* = zor (vi) 
a> τα. = z, and by considering ©, that either (vii) ab + a®b~' = z or 
(viii) α΄ δ + αὐ. = z. Here it turns out that (v) and (vii) are incompatible, 
as also are (vi) and (viii). If (v) and (viii) hold, then (a? + a*)x 
~1(1 + a3) = αὖ = (a? + αδ΄ 1) [( + a°b*), and no plane is obtained. So 
suppose that (vi) and (vii) hold. In order to determine the complete loop 
(R, +), we must define either 1+ 1 =a’ or 1+ 1 -  αδ. But if1+1=a’, 
then (a + α΄ 1) ἴ({ + a~!) = a*b = (a + ὃ) *(1 -- δ᾽ ἢ) Hence, to within 
isomorphism, there exists only one planar loop (R, +) that is compatible 
with R*, and this must yield the plane of class [Va2 over the regular near- 
field of order 25. This completes the proof that there is no plane of class [3 
and of order 25 that satisfies the hypotheses of Theorem 4. 


By Theorems 2 and 4, there are only 12 nonprime powers < 250 that are 
still “possible” orders for a plane that satisfies the conditions of Theorem 4; 
the three smallest of these are 85, 145, and 148. It is perhaps more significant 
that there is no plane of this kind with order 25. However, for n = p*, where 
p is a prime > 3, it may well make a difference whether p = 1 or 3 (mod 4) 
(since this determines whether the Sylow 2-groups of R* are cyclic or gener- 
alized quaternion for the regular near-field), and whether p = 1 or 2 (mod 3) 
(since the elements of order 3 clearly play an important role in limiting the 
possible X-sets). On both counts, and for other obvious reasons, the case 
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n = 49 is of particular interest ; however, it is probably too large to handle by 
the methods used here. In principle, the question of the existence of planes of 
class 13 with “small” order could be settled by computer (cf. Pankin [8]). The 
assumption that (R, +) has both inverse properties reduces the number of 
additive loops substantially. (Even n = 13 would be very laborious to decide 
by hand in the general case.) However, there is no reason to suppose that 
finite planes of class [3 must satisfy the assumption. 
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A Theorem on Cyclic Algebras 


HANS ZASSENHAUS 


THE OHIO STATE UNIVERSITY 
COLUMBUS, OHIO 


Introduction 


A cyclic algebra was defined by L. E. Dickson as a hypercomplex system 
H generated by two elements a, b over a field F such that: 


(1) the subalgebra of H generated by a alone is a cyclic extension E of 
finite degree n over F where the automorphism group of E over F is gen- 
erated by a certain automorphism o of order n; 

(2) bis invertible and 


b~ ‘ab = ao. (1) 


Olga Taussky studied in [2—4] the question of what one can say about the 
reduced norm of the commutator 


c=[a,6)=a4b—6a (ae F(a), be F(a)+ F(a)H) (2) 


in the case that n = 2 and H is a matrix algebra over F. By her results she 
established an interesting relation between the multiplicative commutator 
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equation 

(a,b) -- α΄ δ΄ ‘ab = α΄ (ac) (3a) 
and the little studied additive commutator equation motivating the present 
investigation. 


Let us mention right away that for arbitrary ὦ, a,, a, of F(a) and ele- 
ments a, + ba, of F(a) + bF(a) there holds the commutator equation 


[a, b] = ba,(ao — a) (3b) 


where the elements on the right-hand side of (3a) are characterized as the 
elements of E of norm 1 according to Hilbert Satz 90, whereas the elements 
ao — a occurring on the right of (3b) of the additive version are charac- 
terized as the trace zero elements of E. 


1. Simple Properties of Cyclic Algebras 


It follows from the definition of a cyclic algebra H that H is a centrally 
simple associative algebra with the n* basis elements aib* (0 <i <n, 
0<k <n), that 

O+b"=BeF (4) 


and that β is uniquely determined by H and the structure of E over F 
modulo the norm group N,,-(E\0), the multiplicative group formed by all 
nonzero elements of F that are norms of some element of E over F. 

The algebra H is a ring of matrices of degree n over F precisely if β is in 
the norm group. 

Indeed, if a satisfies the irreducible equation f(a) = 0 where 


fst —a,r'+---4+(-1), (EF, 1<i<n) (5) 


is some irreducible polynomial of degree n over F and if 


a’ 'o = Σ Bua? (B, EF, 1<isn, 1<k<n), (6) 
k=1 
then the companion matrix 
0 1 
0 1 
a= 

1 

(Ξ “ἢ Ὁ ay 


and the matrix b = (B;,) generate the full matrix algebra of degree n over F 
presented as a cyclic algebra such that β = 1, unique up to a norm factor. 
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In general there is the split model of H generated by the two matrices 


By 
ao 0 β1 
a= ao : b= 0 
“S Bn-1 
ao” β, 0.» 0 


with coefficients a, αἀσ, .... ασπ 1, B,,..., B, in some splitting extension ® of f 
over F such that 


n 


f(t) = [](@- ασ᾽ ἢ (7a) 


jr1 


B=BiB. °° B> (7b) 


which gives a concrete model of the cyclic algebra H over F. 

As is well known, any finite-dimensional centrally simple associative 
algebra H over the field F contains maximal commutative subalgebras that 
are finite separable extensions of F. In particular, if H is of dimension 4 over 
F, then H contains a separable quadratic extension E of F. This means that 
in the latter case E is a cyclic extension with involutory generating automor- 
phism o over F such that there is some invertible element b of H so that for 
any generator a of E over F we have (1) and H is cyclic. 

On the other hand, it has been shown recently that not every centrally 
simple hypercomplex system is a cyclic algebra. 

We observe that 


H = + b‘E (8a) 
k=0 
and according to (1) 
ab‘ — b‘ac* = 0 (8b) 
so that 
[δ", a] = b*a — ab‘ = b*(a — ao*) (8c) 
[E+b‘E]=b' Και τ (keZ, ged(k,n)=1) (8d) 
where tr is the F-linear trace mapping 
EF: tr 


n—-1 
xtr= δ᾽ χα (χε ΕἸ 


j=0 


of E on F. Note that the well-known additive version of Hilbert Satz 90 just 
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tells us that 
E(1 — o) = ker tr. (86) 


In the event that the degree n οἵ E over F is not divisible by the character- 
istic of F a considerable simplification of the problem is achieved by adjoin- 
ing to the field of reference F a primitive nth root of unity ζ such that over 
the new field of reference F, = F(¢) there arises the generalized cyclic 
algebra H, = F, ὦ, H = F, H containing the separable normal commuta- 
tive algebra E, = F,; @; E = F,E = F,(a) of dimension n over F, with 
generating automorphism 


E,-E;: σι 


(α Θη)σι -α Ὁ (no) (ΔΕ ,, ηε ΕἸ 
Ε Ε 
and with standard generators a, b. 

The simplification is achieved by Lagrange’s construction according to 
which 


E, = F,(a,),0# αἴ =ae F,, |a/(F,\0)"| =n, (9) 
a,o, = la, (10) 
and Eq. (9) assumes the form 
b-'a,b= Ca, (11) 
or, without inversion, 
a,b — Cha, = 0. (12) 


The latter form of condition (11) has the virtue that for given a, the equation 
a,x — (xa, =0 (13) 


defines the n-dimensional linear solution space E, b = bE;. 
Any one of its regular elements together with any generator of E, over F 
presents H, as a cyclic algebra. 


2. A Theorem on Cyclic Algebras 


Generalizing a recent theorem of O. Taussky for n = 2, we have 
Theorem 1 (a) For given a of E the norms of the Lie products 
[x, a] (x € E + bE) 
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over F form the set 
(@)BN(F) 


where ¢ is a primitive nth root of unity. 
(b) The norms of the Lie products 


[Ρ, ΕἸ 
(b a fixed regular element of E + bE) over F run through all of 
CBN (F). 


(c) For any order A of H over the integral domain A τὰ F with quotient 
field F and for given ἃ of A τὰ E the norms of the Lie products 


[χ, ἢ] (xe (E+bE) nA) 
over F form the set 

COB (a — ao){N(x)|x ε 90 
when Q is the A τὰ E-ideal 

W={ylye(((b E+ E) 0 87 .1ΔΛ) τ Ἦ δ᾽ ᾽Ε) τὶ E}}. 
Proof Note that the minimal equation for ὃ over F is 
b"— Bp=0 
so that 
N(b) = Cp. 

The remainder follows from (8b)}-(8d). 


The striking relationship between any two quadratic subfields of the ring 
of 2 x 2 matrices over the field F established by O. Taussky’s theorem 
stating that the determinant of the commutator of any pair of generators of 
the two subfields is equal to a negative norm into F from each of the two 
extensions is generalized by the theorem only to pairs of subfields of the ring 
of n x n matrices over the field F such that one of them is a radical extension 
generated by an element transforming the other extension according to an 
automorphism of order n. 

In order to test the general relationship between the cyclic extension F(a) 
and the norm of the commutator [4, 6] where 


6 = Sa ‘a, = P,(a), ἃ = P(a); P,(t) ε Flt], P(t) e Flt]; l1<k<n 
k=0 
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we use the split model with βι = B, =-::'= 8B, =y 
ao 0 1 
aa? 0 1 
a= 5 b = γ 
1 
ac” 1 0 0 
where we assume y to be a solution of the equation 
y= β (14a) 


It follows that 


[4, 6] = > ((P(ao') — P(ao*))P i 19,(ao')y?") 
i,k 
i—k if i-—k>0 
((-i- = i-kt+n if i-k<0O, 


I<isnisk<n| 


so that the norm of the commutator of ὦ, b appears as the determinant of the 
matrix on the right, which turns out to be a homogeneous form of degree n 
in both the coefficients of P as well as (jointly) in the coefficients of the P, . In 
the cubic case there appears the simple result 


N([4, ao + ba, + b*a,]) = N(ao — 4)(BN(a,) — β΄ Ν(α3)) 
(ao , dy, 42 € E, n = 3). (14b) 


In general there appears a norm form with many terms. 
A deeper question concerns the Lie algebra characteristics of the direct 
decomposition (8a). This question will be discussed in Section 3. 


3. The Spectral Decomposition Belonging to Maximal Commutative 
Subalgebras 


It is well known that a ring H always is a bimodule relative to any 
subring E of H. 

Let us follow up this remark by a more detailed analysis in case H isa 
unital hypercomplex system over a field F and E is a commutative F- 
subalgebra containing 1,,. 

In that case we interpret H as a representation space of the tensor 
algebra E'?) = E @, E, a unital commutative hypercomplex system over F, 
according to the operational rule 


ΕΠ) + End, H: A 
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h((a, ® a,)A) = a, ha, (he H; ay, a, €E). (15a) 


Let e,, e2, ..., e, be the primitive idempotents of E. It follows that the 
idempotents e; x e, of E?) form a system of orthogonal idempotents of E‘?) 
adding up to 1,2, = 1 x 1. Accordingly, we obtain the decomposition 


i=1 k=1 


of H into the direct sum of the E-bimodules e; He, which are characterized as 
the set of all elements of H with e; as left identity and e, as right identity. 
The multiplicative interaction of the components is given by the rule 


(e;He,)(e, He,,) S δι. ,e; He, (ἰ, Κ, 1, 21,252, 1), (450) 
which implies the Lie interaction rule 
[e; He, , e;, He] Ξ b,, ,e; He, + dy, 6, He, . (15d) 


We remark that the subalgebras e;He; of H are certainly simple F- 
subalgebras in the event that H is centrally simple over F. 
In order to make further progress assume now that: 


(1) Η is centrally simple over F, of degree n; 

(2) E isa subfield of H that is a maximal commutative subring, in other 
words E is an extension of F1,; 

(3) Eis separable over F. 


It follows that E® is semisimple. Hence there are finitely many primitive 
idempotents d,, ..., d, of E that are orthogonal adding up to 1. E® is the 
direct sum of the minimal ideals Ed, (1 < k < 4). Correspondingly, we find 
the decomposition | 


4 
k=1 


of H into the direct sum of the E-bimodules Hd, which are characterized as 
the set of all elements of H with d, as right identity. Let us number the d, in 
such a way that the kernel of the diagonal F-epimorphism 


E® + E: ὃ 
(a, x a,)d = a, a, (a,, a, € E) (15f) 
coincides with the maximal ideal] 


oo 
Ed, . 


k=2 
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It follows that ὃ restricts on Ed, to an F-isomorphism onto the E- 
bimodule containing E. 

As a matter of fact Hd, consists of all elements of H centralizing E so that 
by assumption 


Hd, = E= Ed, 6. 

The E-bimodule H is operator isomorphic to the E-bimodule E™’. 
Indeed, there is a regular element x of H for which x" 'Ex τ E = F. Hence 
H =x" 'ExE = ExE 

and the mapping 
E® + H:1 
(a, x α2}1 = axa (a,, a, € ΕἸ 


provides an E'?)-operator isomorphism. 

In order to discuss the multiplicative interaction of the components let us 
begin with the case that E is a normal extension, which is tantamount to 
q=n or, equally well, to the statement that each component Hd, 15 n- 
dimensional over F. Denoting by G the automorphism group of E over F we 
may number the primitive idempotents of E™) by the elements of G, say d(c) 
(σ € G) in such a way that 


ad(s)=d(c)(ac) (ae E) (15g) 
Ed(c) = d(a)E = E™ d(o) (15h) 
ad(a)i=d(c)i(ac) (ae E) (15i) 
Ed(c)i = d(c)1E = Hd(o), (151) 


and the elements y of the component Hd(c) of the direct decomposition 
(15c) are characterized by the relation 


ay=y(as) (ae E). (15k) 
Since for o, τ of G we have 
ad(c)id(t)1 = ἀ(σγι(ασ)ά(τ)ι = ἀ(σ)ια(τ)ι(ασ)τ, 
it follows that 


d(c)id(t)i = d(at)ic(o, τ) (151) 
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where 
c(p, a)tc(pa, τὴ = c(p, στ)ε(σ, τὶ (15m) 
d(1g)i = 1y, c(1¢, 19) = 1ε = Ἰῃ (15n) 
Hd(c)oHd(t) = Hd(or) (150) 
[Hd(c), Hd(t)] < Hd(ot) + Hd(rc) (15p) 
[y,a]=y(a—ac) (yeHd(c), σεσ, ae E). (15q) 
The last two results show that the decomposition 
LH = Σ Ηακ(σ) (15r) 
σεσ 


of the E-Lie algebra LH associated with H into the direct sum of linear 
subspaces over F is the spectral decomposition of LH with respect to the 
abelian subalgebra 

LE = Hd(1,) (15s) 
that is associated with E. 

In the general case we embed E into a minimal splitting extension of E 
over F and we embed ® as maximal abelian F-subalgebra into the ring of 
matrices H, of degree [®: Εὖ over H. Hence E=@® ἡ H. 

Let G be the group of automorphisms of ® over E, let S = Aut(®/E) 
and let 

0 - ΦΘΩΦ -- Σ᾽ Oda) 
Ε 


σεσ 


where the d(c) are primitive idempotents of ©) such that 
αὐ(σ) τ-Ξ- ἀ(ασ (ceEG, αε Φ) 


and 1 is ἃ ®)-isomorphism of the regular ®-bimodule Φί2) on H, mapping 
®'7)d(1g) on ®. Hence there is the direct decomposition 


H, = Σ Ἢ «(6) (68) 


σεσ 


of the ®-bimodule H, with components 
Η, ἀ(σ) = ἰνίν ΕΗ, & Vala ε D=ay = y(ac))}. (16b) 


The tensor algebra 
4 


EX =EQE= >) Ed, (16c) 
F k=1 
with primitive idempotents d,, ..., d, is a subalgebra of ®'?) such that 


d, = d(1). The question is, which way is the direct decomposition of the 
E-bimodule H = Σκ3, Hd, related to the Remak decomposition (16a)? 
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Any idempotent of ©) is of the form 


= Yd) (164) 


σεᾷ 


where δὰ is a nonempty subset (complex) οἵ G. The question is, which 
complexes are involved in the presentation of idempotents d,? 

Choose an F-basis b,, ..., b, of E. Any element of E®) is of the form 
x = )7_, b; x x; with x; uniquely determined by x in E and 


x=xl=xY d(s)= a x x,)d(o 


σεσ 


=) Σ b;d(o)x; -y Σ Σ d(S0,,.64;)(b; OOK χ; 


i=1 σεσ i=1 k=1 j=1 


where G =| )ζ- δσκ δ, So, S =|_)#, So, o,; with o,; in S and δ᾽ αι =n. If 
x = d(S), then 


1 if So,a,;5R 


Ἔχ; = 1 
2, (bio) 0 if δσ, 5 ¢ R. ( 6e) 
On the other hand, the system of linear homogeneous equations 
Σ (BiG: Oe p)Xi = Very (l<j<q@, 1<k<q) 
i=1 
(1 «- Κ' <y, 1 -κ' <q) 
for the unknowns x,,..., Xx, 


has precisely one solution in ® because of the separability of E over F. But 
upon application of an element o of S to the system the n left-hand sides only 
are permuted though the x; are changed to x;o. Hence, if x; € E, then the 
permutations connected with the elements o of S leave the right-hand sides 
invariant. 

This means that δὲ is a union of several double cosets Sa, S. On the other 
hand, setting y,,;,= 6,,, we find that x,;o = x; for all o of S so that x; belongs 
to E and ἀ(δσ, 5) belongs to Ε(2. 

It follows that the primitive idempotents of E are precisely the idempo- 
tents d(So, 5) corresponding to the double cosets. Hence we have 


after suitable numbering of d,, ..., d,. Thus the multiplication rules 
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and the Lie multiplication rules 
[Hd, , Hd,,] S Hd(So,So,S) + Hd(So,,So,S) 
(l<k<q, 1<k’'<q)  (18b) 


emerge. It should be noted that for any set of double cosets 
{So,,5, eee So,,,5 | 1 < k, < k, << Ks Ss 4) 


we have the rules 
m 


a{ USo.5] = ΤΣ (ὅσ, δ) (19) 


i=1 


Hal Use| 
i=l = 


which tell us all worthwhile knowing about the evaluation of the right-hand 
side of (18a), (18b). 


Hd,, (20) 
i=] 


i= 


4. Crossed Product Rings 


For any unital ring R there is the canonical homorphism 
U(R) > Aut(R): 0 
a(x0)=x™*ax (aeER, χε U(R)) (21) 


of the unit group U(R) of R into the automorphism group, Aut(R), of R. 
Its image is a normal subgroup Inn(R) of Aut(R), the inner automorphism 
group of R, which is normal in Aut(R) according to the transformation rule 


α΄ *(xO)x = (0 (xe U(R), we Aut(R)). (22) 
The factor group is the outer automorphism group of R 
Out(R) = Aut(R)/Inn(R). 
Supposing R is contained in a unital overring A such that 
ee (23a) 


The normalizer of R relative to the unit group U(A) of A is defined as 
the subgroup 


Nua(R) = {x|x € U(A) ἃ x7 'Rx = R} (23b) 


of U(A) containing U(R) as normal subgroup. 
For any subgroup H of Nua)(R) we have the relations 


hR=Rh  (heH) (23c) 
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so that R and H generate the subring 

HR = <R, HD = RH (23d) 
of A. We set ourselves the task of studying the structure of HR in terms 
of H and R. 
Analysis of the Problem 


Without loss of generality it may be assumed that 


U(R) © H « Nya)(R), (24a) 
HR = A = RH. (24b) 
We observe that the kernel of the homomorphism 
H = Aut(R): φ 
a(hg) = h~ ‘ah (aeR, με ΗῚ (24c) 


is the centralizer of R in U(A): 
Cya(R) = {x|x € U(A) & Va(a € R = xa = ax)} 


and that the kernel of the corresponding homomorphism of H into the 
outer automorphism group of R 


H > Out(R): @ 
ho =ho/Inn(R) (he FH) (24d) 


is the product of the two normal subgroups U(R), (Cua)(R)) 0 H. The 
ring R, = <R, Cya)(R)> generated by R and Cya)(R) is the product of the 
two elementwise commuting subrings R and Κρ = (Cy,a)(R)> intersecting 
in a subring of the center 3R of R. There hold the relations 


hR, ΞΞ Κι ἢ 
for any element h of H so that 


CR, HY = HR, =R,H = Y¥ hR,. 


heH 
Without loss of generality we can also assume that 
Cua(R)S 3R, ker @ = U(R). (24e) 
Hence we have the exact sequences 
1>U(R) —> H —> S-1, 
15U(3R) “lk. H --. Aut(R) (24f) 
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and the conditions 
ΚΒ = {alae R & Vh|he -Ξ- α -- α(μφ)ὶ Ξ 323[8), ππφ-θ (24g) 


when 1, is the natural embedding monomorphism of U(R) in H, ες is the 
natural epimorphism of H on the factor group S = H/U(R), and @ is a 
monomorphism of H in Aut(R). 

Thus H emerges as a special kind of group theoretic extension of the 
unit group of U(R) characterized by (21), (24a)-(24c), (24e), and (24f) which 
we will call an R-admissable extension of U(R). 

Our task is to describe the rings A satisfying (24a) and (24b) in terms 
of the R-admissable extension H of U(R). 


Definition The crossed product of a unital ring R and R-admissable 
extension H of the unit group U(R) of R is defined as the ring R x H 
with the generators ἄ(α € R), h(h € H) and the defining relators 


ὦ, +4,=a,;+ 42, aa, ΞΞ 16) (a;, a2 € R), (25a) 
hyhz=hyh, — (hy, hp € Η), (25b 
XIy =X (xe U(R)), (250) 
h-*ah = a(ho (aeR, heH). (25d) 


Theorem 2 (a) The mapping 
R-Rx H:1, 
al,=a (ae R) (25e) 
is a monomorphism of the ring R into the crossed product of αὶ and H 
mapping the unit element of R on the unit element of R x H. 
(b) The mapping 
H - U(R x Ht): 13 
hiz=h (he) (25f) 


is a monomorphism of H into the unit group of the crossed product of R and H 
such that 


tats = U(R)| tg (258) 

and 
H1,Ri,=R xX H = Rl, Hi, (25h) 
Nurx m(R12)Ri, =K x AH = Ri, Nurx (R12), (25i) 


Cur x (Κι) = Κι. (251) 
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(c) Identifying R with Ri, via the embedding monomorphism 1, and H 
with H1, via the embedding monomorphism 13 the crossed product of R and H 
appears as the universal solution of our task of which all other solutions are 
obtained by the application of an R-epimorphism. 


Proof Let T a transversal of H modulo U(R):, such that 
H = T(U(R)i,) = (U(R)i1)T 
T © T(x1,) is empty whenever 1 +4 χε U(R), 
1, € T, (26a) 
{τ΄ =t(c(t’,t’)t,) ((, τ΄, te T; c(t’, 15) ε U(R)). (26b) 


The set (ΚΕ x H)* of all formal sums δ᾽, εὐ t(ta) (a any mapping of Tin R 
with only a finite number of elements of T with nonzero image) with the 
operational rules 


δ᾽ t(ta) = Σ᾽ t(th)<a=b, (26c) 
δ t(ta) + ΟΣ t(th) = Σὲ t(t(a + b)), (26d) 


| »y (a) Σ t(tb) = > t(t(a x b)), (26e) 


teT re T te T 


where the mapping a x b of Tin R is defined by setting 


ax b)= ΟΣ οἱ, t")((ealt”o)\(r"b)), 
ve rs UR) 
is verified to be a ring by an easy computation. The mapping of a on lya 
(ae U(R)) and of t(x1,) on tx (te T, χε U(R)) preserves the defining 
relators of R x H as another easy computation shows. Moreover, the 
elements 1,,a, tx of above generate (R x H)*. Hence the mapping of above 
extends uniquely to an epimorphism ¢ of R x H on (R x H)*. 

On the other hand for any solution A of our task the mapping of the 
formal sum δ, ετ ¢(ta) on the corresponding sums defined in A preserves 
the operational rules (26d) and (26e) as a third computation will show. 
Thus it is shown that ¢ is an isomorphism and that (26c) holds. 

The mapping of a on 1,4 (ae R) turns out to be a monomorphism 
of R in (R x H)*. Thus (a) is demonstrated. Note that μα is the unit 
element of (R x H)*. The mapping of t(xz,) on tx ({ε T, χε U(R)) turns 
out to be a monomorphism of H into Νυκκχ βγη Ἢ R) mapping x1, on 1yx 
(x € U(R)). Thus (25b), (25h), and (251) are demonstrated. 

Finally let us consider those elements z = [μα (a € R) which commute 
with all elements tx (t € Τὶ x € U(R)). Each element ἢ of H is of the form 
h = t(x1,) for some t of T, x of U(R) and conversely; the commutativity 


A THEOREM ON CYCLIC ALGEBRAS 377 


relation z(tx) = (tx)z is tantamount to Ζ(!φ) = z. Hence according to (24f) 
we have z € 3(R), demonstrating the remainder of Theorem 2. 


Extension Invariance 


One of the criteria applied to any algebraic construction is the question 
how well it behaves in regard to an extension of the basic domain of 
rationality. 

Suppose the given unital ring R is embedded by means of an embedding 
monomorphism 

R-Q: ip (27a) 


into a unital ring Q such that the unit element of R is mapped by tp on the 
unit element of Q. 

Is it possible to embed a crossed product of R with an R-admissible 
extension H of U(R) into a crossed product of Q with some Q-admissible 
extension Hg of U(Q) such that the embedding monomorphism 


Rx H>Q x Ho: trxag | (27b) 


restricts to tg on R but to a monomorphism into Hg on H and that the 
ring Q x Ho is the ring generated by the subrings (R x H)iz,x 47, Q? Is it 
true that Hy = Η͂,,., U(Q)? 
We observe that necessarily the homomorphism 
Hy Aut(Q): @o (27c) 


inherent in the construction of Q x Hg restricts to a homomorphism 
H Qo for which 


(aig)(hpg) = a(hp) (ae R, he Η), (27d) 


a condition which restricts our choice of Q. 
We consider only the case that Q is the tensor product of the ring R 
and a unital overring S of the central subring R” of R over ΚΗ: 


IRXH 


Q=RO@S (27e) 
RH 
when it is required that 
im = Lu 


and that the canonical embedding homomorphism (27a) mapping aon a ® 1; 
(α ε R) is a monomorphism. Without loss of generality the canonical 
homomorphism 


5-»Ω: Is 
Sis = 1p @S (5 ε δ) 


can be assumed to be monomorphic. 
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Now there is the canonical homomorphism (27c) given by setting 


(a @s)(h@q) = (alhe)) x 5 (27f) 

satisfying (27d). 
The condition (270) requires Hy to be a homomorphic image of the 
amalgamated semidirect product of H and the unit group of Q over U(R), 


say 
U(R) 


HO U(Q). 


φῶ 
Here quite generally we define the amalgamated semidirect product 
N 
ΑΓΒ 
Ψ 


of two groups A, B over a common normal subgroup N as follows. 

We assume N to be a normal subgroup of A with given monomorphism 
ιν into B such that Niy is a normal subgroup of B. Furthermore there is 
a homomorphism of A into the automorphism group of B given satisfying 
the coherence condition 


n0,¥%=npO, (πε Ν) (276) 
when 0,, 0, denote the canonial epimorphism of the elements of A, Β, 
respectively, on the corresponding inner automorphism groups. Let 


N 
AD B={aOiblaeA, be B! (27h) 
Ψ 


with multiplication rule 
(α ΓῚ ῥ)(α’ Ob’) = (aa’) 0 (ῥ(α Pb’) (271) 
and equality rule 
(α, α' ε 4: »,0' ε Β) aQb=a [] δ΄ «-α΄΄ἷα ΕΝ & a law 
= δ᾽. 
One verifies easily that A OB is a group with canonical embedding 
monomorphisms 


N N 
A>-AQB:1,, BoALDLB: 1, 
Ψ Ψ 


ἀϊχ Ξε ἃ Γ] 1. θιβμτξε 1,06 
satisfying the coherence condition 
Nli,= ln lp, (27k) 
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moreover 
N 
Νι, ΑΓΒ, Byp<AOB, (271) 
Ψ 
Ν 
(A14)(Big) =A C] B, Al, ΓᾺ Big = Ni,.- (27m) 
Ψ 


A particular example are the semidirect products of A, B arising whenever 
N = 1. We denote them simply as A ΓΠ ψ B. 

Continuing with the construction of an extension of the given crossed 
product R x H by means of the tensor product (27e) we let 


U(R) 
Ho =H OO U(Q), (27n) 
φῶ 
Hy > Aut(Q): o% 
(h [Π €)94 = πφρίεθᾳ) (270) 


(he H, εε U(Q), θῳ the canonical epimorphism of U(Q) on Inn(Q)). It 
follows that Hg is an Q-admissible extension of U(Q) such that the crossed 
product 2 x Hog is an extension of R x H in the sense defined in the 
beginning of this section. Let us observe the isomorphy relation 


Q x Hp =~ (R x H)@S (27p) 
RH 


with canonical isomorphism mapping the element (a@s)(hCle) on 
(ah @s)e (ae R, 5 Ε δ, he H, ee U(Q)). 


Examples 


(a) The group ring R[G] of a group G over a unital ring R is a crossed 
product of the subring R[N] and G for any normal subgroup N of G that 
contains its centralizer in G according to 


R[G] = R[N] x GU(R[N]). (28a) 


(Ὁ) Suppose R is a unital ring with primitive idempotents E,, ..., E, of 
the center of R such that E, +---+ E,=1, and H is an R-admissible 
extension of U(R), then the automorphism group H@ permutes the finite 
set S = {E,,..., E,} such that S is partitioned into a finite number S,,..., 5 
of Hg-orbits. It follows that the idempotents 

E, = Σ Εἰ ([ «) Ξ σὴ 


E,eS; 


a 


are invariant under the automorphisms of R belonging to H@ and that 
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the idempotents Es ,..., Es_ are orthogonal idempotents of the center of the 
crossed product R x H with the unit element as their sums so that there 
is the split 


Rx H= ΣΈ, (Κ x H) 
j=1 


of R x H into the direct sums of σ ideals of R x H. 

For each of those ideals let us form the centralizer C,(Es,R) of the 
subring Ες R of Es(R x H) relative to H and a transversal T; of C,(Es, R) 
modulo the normal subgroup C,(Es,R) αὶ U(R). It follows that the ring 
Es(R x H) contains the subring 

te Tj 
invariant under Hg and that Ες (Κ x H) is a crossed product of R; and 
ἔς Η so that 


R x H Sond OF=1 R; x Es H. (28b) 


In this way the study of the crossed products for a ring R that is 
isomorphic to the algebraic sum of a finite number of algebraically 
indecomposable rings is reduced to the case that those indecomposable 
components are transitively permuted by the group of automorphisms Hg. 

(c) Let us study the crossed product of a semisimple ring R with an 
R-admissible extension H of U(R) such that the primitive idempotents 
E,,..., E, of the center of R are transitively permuted by the automorphism 
group Hq. According to the second structure theorem of McLagan- 
Wedderburn there is the decomposition 


R= )°R,; when R;=E;R, (29a) 
j=l 


of R into the discret sum of o simple ideals. By construction the simple 
components of R are isomorphic to one another. 

Let us analyze this situation! 

For any simple ring R, there is the restriction homomorphism 


Aut R, > Aut(3R,): μὴ 
Op, = 4Κ||α (a € Aut(R,)) 
of the automorphism group of R, into the automorphism group of 3R,. 


Each automorphism « of R permutes the primitive idempotents of the 
center of R so that there is the permutation representation 


Aut R-> p,: A 
Ejaa = Ej (a € Aut R). (29b) 


J 
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In fact 
(Aut R)A = p, (29c) 


and Aut R is the semidirect product of the kernel of A and the subgroup 
p, formed by the automorphism 


R-R:t& 


(ra θὴπ- 24:9 (29d) 


(x € p,, a; € Ry, 0; an isomorphism of R, on R; (1 <i <a), 6; = 1p,). 

The kernel of A consists of the automorphisms of R that leave every 
minimal ideal invariant. It is the direct product of the subgroups Aut(R;) 
formed by the automorphisms of R fixing the complementary ideal of R; 
in Καὶ element wise (1 <i <0). 

There is the canonical monomorphism 


Aut R; > ker Δὲν; 


[ δ᾽ α)θ,)αν, = a; 0; 0 + >, α|θ;, 
)ΞῚ1Ὶ j=l 


πὶ 
(α; ε Κι! -) 3 σ), «e Aut R;) (29e) 
sending Aut R; on Aut R; and, of course, there are the isomorphisms 
Aut R; = 6; ‘(Aut R,)0; (29f) 


of Aut R; and Aut R, for i = 1, 2,..., σ. 

In this way the automorphism group of R is seen to be isomorphic 
to the wreath product of p, over the automorphism group of R,, the 
outer automorphism group of R is isomorphic to the wreath product of 
p, over the subgroup Aut(R,)u, of Aut(3R,): 


Aut R = Aut R,Sp,, (29g) 
Out R ~ Out R,§p,, (29h) 
Aut R, =~ Aut R, (1 <j <o), (291) 
Out R - (Aut Ry). (29}) 


The factor group of an R-admissible extension H of U(R) over U(R) 
is isomorphic to a subgroup S of Aut R, in fact there is the exact sequence 


1+U(R) 4 H -% Sot. (29k) 


By assumption the permutation group HA of σ letters is transitive. 
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Conversely, if S is a subgroup of SA such that Ηρ μ is transitive 
and if there is the exact sequence (29k) for H find the crossed product 
R x H with the property that 1, is the only idempotent of R invariant 
under Hog. 

When is the crossed product simple? In view of the conjugacy of the 
orthogonal idempotents E,; (1 <j <o) under H the simplicity of R x H 
implies that the subalgebra E,(R x H)E, is simple. 

If the element ἢ of H does not belong to the H-centralizer H, of Εἰ. 
then there holds an equation 


with j 1 so that E,hE, = 0. Hence 
E,(R x H)E, — H,R, = R, Ai. (30) 


Furthermore the elements of H, commuting elementwise with R, form a 
normal subgroup Ηΐ of H, such that 


E, Ht E, ~ H,/Aut R,, 
U(3R,) A E, HE, 
there is a transversal T, of E,H*E, over U(3R,) such that 
E, HtE, = T,U(3R1), Τὶ 0 U(3R1) = ῈΕ,, 
xT, 0 UBR,)=® (1 # χε U(3R,)), 
U(3R, At = U(3R,)T, 


T, is linearly independent over 3R, and U(3R,)T, is the centralizer of 
R, in Κι... 

Fhe simplicity of R,H, implies simplicity of the centralizer of the simple 
subalgebra Κι. As can easily be shown, Τὶ is finite. As the example 
provided by any quaternion algebra over a field of characteristic #2 shows 
it can happen that 7, consists of more than one element. 

Conversely, if Cr, #,R; is simple, then it follows from E. Artin’s argument 
on nonvanishing sums of the form 

δὲ h(ha) 
he X; 
(X, a transversal of H, modulo the normal subgroup N, of all elements 
n of H, for which ng belongs to Inn(R); a is a mapping of X, in R, with 
all but a finite number of images equal to zero) which are contained in a 
given ideal of E,(R x H)E, that the subring E,(R x H)E, is simple. 

Now it follows from E. Artin’s argument applied once again in another 

context that R x H is simple. 
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Theorem 3 (a) Any semisimple ring R is of the form 
σ 
j=l 
where E,, ..., E, are the primitive idempotent elements of the center of R and 
the numbering is chosen in such a way that there are isomorphisms 


E,,R— E;R: 0; 
(,-1<j<o,, 0,,=12,R, 1sk<t, t>0O, 
0 = do < 6, <0, «::«σςΞ σὴ 


but that the simple algebras E,, R are nonisomorphic for k = 1, 2, ..., τ. 
(b) There is the permutation representation 


Aut(R)— p,: A 
ΕΞ μα (ae Aut(R), 1<j<o) 


with t orbits δὲ = {o,-1 + 1, Lewy σκὶ (1 < k < τ). 
(c) The automorphism group of R is the direct product of the subgroups 
(Aut R), formed by the automorphisms of R which leave the idempotent 


as well as every element of the complementary ideal (1, — Es,)R of Es, R fixed 
(1 <k<v7). 

(d) The direct component (Aut R), is isomorphic to the wreath product 
of Aut(R,,) and Ῥισι-- σι. (1 <k <7). 

The outer automorphism group of Es, R is isomorphic to the wreath product 
of (3R,,)|Aut(R,,) and Ῥισι-- σκ-- 1) ( «ΚΞ τ). 

(e) The crossed product of R and an R-admissible extension H of U(R) 
is simple precisely, if t = 1, H@A is transitive and the centralizer of the 
projective group algebra 3(E,R) ® wye,r)) Cu E1R is simple. 

It is a hypercomplex system over some field precisely if (e) is satisfied 
and if 3(R) is of finite dimension over the field Cp(H), i.e, Cy(E,R) is of 
finite index in Cy(E,). 

It is of the O. Teichmiiller type precisely if (in addition) R is 
commutative and Ομ(Ε 4) = Cy(E,R). 

It is of the classical A. Albert-R. Brauer-H. Hasse-E. Noether type 
if (in addition) R is a field. 

It is of the Dickson-Wedderburn type if (in addition) R is a finite cyclic 
extension of ΕἾ. 
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Corollary The automorphism group of a ring of matrices R = R£*S over 
the unital commutative ring Ro is the semidirect product of the centralizer of the 
matrix units in Aut(R) and the inner automorphism group. The first factor is 
isomorphic to the automorphism group of Ro with the canonical isomorphism 
which maps the matrix (a;,) on the matrix (a;,«) for « in Aut(Ro). 


5. Crossed Product Orders 


Let E be a semisimple ring and let H be an E-admissible extension of 
U(E) such that the crossed product A = E x H is simple of finite dimension 
over the center F = E”, 

In this section we study the arithmetics of certain orders of A over 
dedekind rings R of F with F as quotient field which are related to the 
presentation of A as a crossed product. 


Definition The R-order A of A is said to be a crossed product order 
relative to the given presentation of A as crossed product over E if there 
holds the decomposition 

Η : U(F) 


A= 2 “A ὦ (HE) (31a) 


i= 


of A into the direct sum of the intersections of A with the crossed product 
components of A, where 
H:U(F) 


H= |) Η,, Hy, =U(E) (31b) 


i=1 
is the coset decomposition of H modulo U(E) and where 


H:U(F) 
A= Σ ΗΕ, (310) 
i=1 


(AO H,E(AQH,F)=A0H,H;E  (i,j=1,2,...,H:U(E)). 
(31d) 


We observe that because of (31a) a crossed product order A satisfies the 
invariance condition 


Ny(A © U(E))U(E) = Η. (316) 


Theorem 4 (Benz-—Zassenhaus) Let A be absolutely semisimple over F. 
Every crossed product order A is embedded into a crossed product order ΛῈΣ 
intersecting with E in a hereditary order such that the embedding is invariant 
under all F-automorphisms of A. 
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The intersection of all maximal orders of A containing A** is a hereditary 
order A* of A. 


Proof “Hereditary orders” emerged in the theory of orders as an appro- 
priate generalization of “maximal orders” first conceived by Auslander and 
Goldman and independently by H. Benz about twenty years ago. Of the 
many equivalent definitions of hereditary orders of a semisimple hypercom- 
plex system B over F let us mention the following definitions. 


(H1) The order of B is said to be hereditary if any A-sublattice M, ofa 
A-lattice M has a complement in M precisely if the A-factor module M/M, 
is a A-lattice (Auslander-Goldman). 

(H2) The order A of B is hereditary precisely if for every prime ideal ") 
of R the )-radical of A, ie., the ideal Jy(A) of A containing YA with the 
property that Jy(A)/A is the maximal nilpotent ideal of A/%)A, is invertible 
relative to A (Auslander-Goldman). 

(H3) The order A of Bis hereditary precisely if for any prime ideal 9) of 
A the left order of Ja(A), ie., the R-dedekind module 


[Jy(A)/Jg(A)] = {ala € A € aJy(A) Ξ Jy(A)}, 


coincides with A (Zassenhaus). 
(H4) The order A of B is hereditary precisely if for any prime ideal 4) of 
A the right order of Jy(A), ie., the R-dedekind module 


[Jo(A)y(A)] = (816 € A € Jg(A)b Ξ Ja (A); 
coincides with A (Zassenhaus). 


(H5) The order A of B is hereditary precisely if for any prime ideal 3) of 
A its A-idealizer, i.e., the R-dedekind module 
[Jg(A)/Ja(A)] 0 [Jy(A)\V(A)], 
coincides with A (Zassenhaus). 
(H6) The order A of B is hereditary precisely if for any prime ideal 9) of 
R the )-localization of A, i.e., the (R/(R — )))-order 
A 


ee 1:1 ς 
R-¥ {Atalae ANA &AER λέ ἢ) 


is hereditary over the Y)-localization 


R 


ey ae R,A¢ Y}, 
R-Y fAir|jreR&iAe ¢ Y} 


of Καὶ (Auslander—-Goldman). 
(H7) The order A of Bis hereditary precisely if for any prime ideal 2) of R 
the 9)-adic completion of A, i.e., the Ry-order Ag of all )-adic limits of 
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Y)-adic convergent sequences of elements of A, is a hereditary order over the 
Y)-adic completion ring Ry of R (Auslander-Goldman). 

(H8) The order A of B over R is hereditary precisely if the intersections 
of A with the simple components of B are hereditary over R and if A is the 
direct sum of its intersections with the simple components of B 
(Auslander-Goldman). 

(H9) The order A of a commutative semisimple hypercomplex system 
B over the quotient field F of a dedekind ring R is hereditary precisely if it is 
the maximal order A(B, R) of B over R (Auslander-Goldman). 

(H10) The order A of a division ring B of finite dimension over the 
quotient field F of a complete local dedekind ring R is hereditary precisely if 
it is the maximal order A(B, R) of B over R (Auslander-Goldman). 

(H11) The order A of B isomorphic to the ring of matrices D/*/S of a 
division ring D of finite dimension over the quotient field F of a complete 
local dedekind ring R is hereditary precisely if D and the f? matrix units e;, 
(i, k = 1, 2,..., f) for which 


Cap lap = Opa Caps » Cygh = Ang 
(a, B, α΄, β' = 1, 2,...,f;A ε D) 


te f 
B= > >, Degs ; > x= ly 
a=1 p=1 a=1 


can be transformed by a suitable inner automorphism of B in such a way 
that 


A=A(fi, fas +9 f5 D, R) 


. 1 
=\) Σ Ane & Au € A(D, R) 
i=1k=1 
h+1 


ἘΣ K<i< Dh, Vh<kss 


h 
& A, € J(A(D, Κ))}1 «κε Vf,<i 
j=l 


(s>0;f,>0(1 <i<s); 

ie ΞΕ WS 2 ass 

J(A(D, R)) the Jacobson radical of 
A(D, R) which coincides with the 
only maximal ideal of 

A(D, R)) (Auslander-Goldman). 


For the proof of the first part of Theorem 4 let us point out that every 
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maximal order of A is hereditary. Also, because of the absolute semisimpli- 
city of A the discriminant ideal 


9(A/R) = ideal of R generated by the determin- 
ants det(tr(a;b,)) for all 2n-tuples a,, 
...5 G,, D;,..., 5, of elements of A and 
the reduced system trace function tr 


is not zero and that consequently for all prime ideals 2) of R not dividing 
S(AR) the 9)-localization (A/(R — ))) is a maximal order of A over 
(R/(R — ))). The discriminantal ideal 9(AR) is the power product of finitely 
many distinct prime ideals ),, ..., ἢ), of R. 

(H12) A is hereditary precisely if the finitely many localizations 
(A/(R — 3),}} are hereditary over (R/(R — 3)}}} (i = 1, 2, ..., t) where Q),,..., 
9} run over the distinct prime ideal divisors of 9(A/R). 

Form J yajey(A) = (\i=1 ὑφ (ΔΛ), and its A-idealizer A"). It follows that 
A is hereditary precisely if A = A". If A is not hereditary, then A πα A™. 
Setting ΛΟ Ὁ) = (A) for i = 1, 2, ... it follows that there is a nonnegative 
integer h(A) such that A“) = A* is hereditary, and moreover, 


Ac AM « --- a AAD), 


It is clear that the formation of A* is invariant under the automorphisms of 
A; A** = Σ ΛΈ ἡ (HE) is a crossed product of order A, A* 7 E is 
hereditary. 

Thus the first part of Theorem 4 is established. 

For the proof of the second part, use (H8). We may assume without loss 
of generality that A is simple. 

Using (H6) and (H7) we may assume without loss of generality that the 
dedekind ring R is locally complete. 

If @ is a finite unramified extension of the field F, then the integral 
closure of Καὶ in φ is a dedekind ring R, with ¢ as quotient field such that the 
Jacobson radicals are related by 


J(R,) = J(R)R, - 
For any R-order Q the tensor product ring Q @,» R, is an R,-order with 
Jacobson radical 


7. Θ κι) - JQ)@R,. 


R 


In case Q is hereditary then it follows from (H3) thatQ @, R, is heredi- 
tary, and conversely. 
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Applying a suitable unramified finite field extension to the ground field, 
it may be assumed without loss of generality that A is a ring of matrices of 
finite degree n over F, that the center of E is the algebraic sum of σ 
isomorphic fully ramified extensions of F, and that E itself is a ring of 
matrices of degree m over its center. It follows from the corollary of Theorem 
3 and from (H11), that we may make the following additional assumptions 
without loss of generality: 


ee, Ξ G#/; iJ =1, Σου}; Ye=l, 


= δ᾽ δ μα (1 <j<o) 
a=1! 


© jap Cha’ p’ -- O ix O gar © jap’ 
(l<j<o, 1<k<o; «8, α΄, β' =1,2,..., m), 


m m 


= 5 YY ΣΡ, 


he Ho j=1 αΞ 1 


AA E= ΣΟ XD SeI(RPO MC ag F, 


j=1 a=1 p=1 
δία, 8)=O0 if m,y<a<m,,m,_,<B<m, 
0 ΞΞ γω <m «-"- « πη, ΞΞ ΜΗ, 
δία, B) = 1 otherwise, 
H=H,.QOU(E), Ηρ a finite group, 


1+ Η0 —“°> Aut(E) exact, Ho@ ὦ Inn(E)= Ip, 


Ho @o acting transitively on e,, ..., e,, the primitive idempotents of the 
center of E according to the operational rule 


e(h@o) = yh) (h €H,, l<j< σὶ, 


h~*eagh = j(hA)ap (he Hy, l<j<o; a, 6 = 1,2, 22+; m), 
A= Σ AJ(RYPO(A OE) — (v(h) ε Z for h of Ho), 
he Ho 


v(h'h") = v(h') + v(h") (bh, h” ε Ho). 


But since Ηρ is finite and Z is torsion free, it follows that v(h) = 0 for allh 
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of Ηρ. Hence 
A= )h(A ὦ E). 


he Ho 


There are elements h; of Ho satisfying 
hy'eyh,=e; (lL <i<o), 
so that the elements 
e, = eh; be, hye Gch = 1, 2.1... 5) 
satisfy the rules for matrix units 


CK eign: = One Cin: (i, k, ΗΝ Κ' ΞΞ- 1, Zz δἰ Sie a). 


Furthermore, 
ἐι Ξε (i 1,2,..., σ) 
Hence 
Ax ((A ὦ δι ΗΠ} x Hoy U(A ὦ 6. Η))"σ, 
where 


Ho, = {h|he Ho & e,h= μοι) 


is the H-stabilizer of e, acting on the matrix units. We may assume without 
loss of generality that σ = 1 so that the order A is isomorphic to the tensor 
product ring over R of the hereditary order 


=y x I J(RY Pen, and <A, =3(E)x Ho. 


a=1 p=1 
We apply the 
Lemma _ The tensor product ring of a hereditary order Q, over R with R 


as center and of any hereditary R-order Ὡ, with absolutely semisimple central 
quotient ring is a hereditary order. 


Proof Using (H8) and the invariance under finite unramified extension 
of the field of reference it suffices to consider the case that 


FQ, =F™*™, FQ, = (3EQ)™*™, 


Now the lemma follows from (H11) by explicit computation. 
The application of the lemma reduces the proof of Theorem 4 to the case 
that E is a separable normal finite purely ramified extension of F, and that 


H = H,)(1U(E), Hy @o = Aut(E/F), A=R,g x Ho, A = F"™". 
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In this case the integral closure Κα of R in E is generated over R by an 
Eisenstein element & satisfying the equation 


f(€)= or + Dane =O =(J(R)=2R; λ:έ Καὶ (l<i<n); A,=1). 


where f(t) = t" ἘΣ τς, A,t"~' is an irreducible monic polynomial of R[t]. 
Choosing suitable matrix units of A we may assume that A = F"™", 
A © R"*" and that 


0 1 


0 1 
—m% —-A,.1% °°" --λιπ 


is the accompanying matrix of f so that A is contained in the hereditary 
order A of all matrices («;,) 


(a, ER, a, EnR ifk<i(i,k =1,2,...,n)). 


Any maximal R-order of A containing A is of the form X~‘R"*"X with 
nonsingular matrix X of F"“" such that 
Ae X~'R"*"X, XANAX eR. 


But all irreducible R-representations of A are R-equivalent to the natural 
representation, so that there is also an element Y of GL(n, R) for which 
Y-1AY = XAX~!'. Hence 


YXA(YX)'=A YX ε RA, 
YX =cé" with veZ, ee U(A), X=Y'eé’, 
X~IR™"X = ἔπνεε LY ROY We ev -- E~ YR" MEY D E- AE, 
But by construction we have 
E-1AE=A 
so that indeed ᾿ 
ΧΟ 53... 


Consequently, A is contained in the intersection of all maximal orders con- 
taining A. On the otherhand, we know from Auslander-Goldman that every 
hereditary order is the intersection of the maximal orders to which it be- 
longs. Therefore, the intersection of all maximal R-orders of A containing A 
is the hereditary order A. Thereby Theorem 4 is established. 
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Applications 


I. Let G be a finite group containing a cyclic normal subgroup N that is 
its own centralizer in G and let A be a faithful irreducible rational integral 
representation of degree f. 

It follows that the rational linear combinations of the matrices gA 
(g ε G) form a simple hypercomplex system QGA with Z-order ZGA that is 
the crossed product ZNA x GU(ZNA). Hence the intersection of all maxi- 
mal Z-orders of @GA containing ZGA is a hereditary order ZGA. The inte- 
gral representation of G that are arithmetically equivalent to A fall in as 
many arithmetical equivalence classes as the irreducible integral representa- 
tions of the hereditary order ZGA from which they are derived by restriction 
to G. 

The locally integral representations of local hereditary orders can be 
determined explicitly. 

The integral representations of global hereditary orders can be derived 
from its localizations and a generalization of a famous theorem of E. 
Steinitz. 

II. Let A be a simple hypercomplex system over the rational number 
field 2. Let E a maximal commutative subfield of A that contains the center 
of A as subfield such that E is a normal extension of A. 

Then A is a crossed product of E and Ny,4)(E). There are corresponding 
crossed product Z-orders. 

The intersection of all maximal Z-orders of A containing a crossed pro- 
duct order is a hereditary order. 

Note that the crossed product orders fall into a finite number of equi- 
valence classes under transformations by non zero elements of E. 


6. Valuation Theoretic Interpretation 


In pursuit of H. Benz’s ideas we give here a valuation theoretic interpreta- 
tion of Theorem 4. Given a dedekind ring R with quotient field F, we 
associate with every prime ideal p of R a multiplicative valuation » = 9, 


F > R*°: 9, 
characterized by the rules 
xp > 0x #0 (x € F), (32a) 
(xy)p = (x@)(y@) (x, y € F), (32b) 
(x + γὴφ <max(xg, yp) (χ, ye F), (32c) 
χφεῖ if χεᾷ, (324) 
xp <l if xep. (32e) 
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It follows that φ, is discrete such that 
max x9, = y, =y < 1. (32f) 
xep 


Given a semisimple hypercomplex system A over F we study the Kuerszak 
valuations 


A— R?°: ® 
of A which are characterized as those mappings ® of A in R > 0 for which 
a® > 0a +0, (33a) 
(ab)® < (a®)(b®) (a, b € A), (33b) 
(a+ b)\® < max(a®, b®) ~— (a, be A), (33c) 


and among them in particular the Kuerszak extensions of g, characterized 
by the additional condition 


F|\®=4@,, (33d) 
as a consequence of which we have ¢,-multiplicativity 
(χαὴῷ = (ax)® = (a®)(x®) (ae A, χε F). (33e) 


The trivial case R=F usually is not considered. Furthermore, upon 
proceeding to the g-completion, we may assume without loss of generality 
that R is local with p as its only maximal ideal and complete discrete 
multiplicative valuation @ = g,. Denoting by 


M(®, p) = {alae A ἃ aM < p} (33f) 


the submodule of A formed by all elements of A for which the value of the 
Kuersak valuation ® is not larger than p (0 < peR) we find that the 
Kuersak valuation ® of A is an extension of g precisely if 


pYM(®D, p)= M(®, yp) (ve Z, pe R*°). (33g) 
We note that the multiplicativity 


MP, p)M(e, p2)= MY, p12) (ρι. P2 ε R=O) (33h) 


is a consequence of (32b), whereas for Kuerszak valuations ® of A in general 
only the weaker condition 


MP, p,)M(®, p2) S M(®, p; p2) (P1, P2 Ε R>O) (331) 
holds that is tantamount to (33b). It implies that M(@®, 1) is an R-order of A. 


Definition The Kuerszak extension ® of ῳ to a Kuerszak valuation ® of 
A is said to be hereditary if it satisfies the conditions 


m—— COnMoOo ὦ »» 


ar WM OOO 
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M(D, p1)M(, p2)= M(®, pip2) — (P1, p2 ε R>O), (33)) 


να[α EA ἃ «(Ὁ MM, ρ) « (ἡ ϑι(Φ, p)>a® « ἢ (33Κ) 
p<1l p<il 
As a consequence of these conditions and of (H3) the R-order 
A = M(®, 1) of A is hereditary such that 


J(A)*=pA = (eg € Z>0), (331) 
αᾧ -- γΦ if αεὐυ(Λ)"; ag J(A)"*' (O#aeA) (33m) 


and conversely (331) implies that (33m) defines a hereditary Kuerszak exten- 
sion ® of φ. 

Here the number ἐφ is called ramification index of ® over @ because the 
®-values 4 0 form the cyclic group generated by y'/** which contains the 
value group of @ as a subgroup of index ἐφ. 

Theorem 4 admits the following valuation theoretic interpretation: 

Let A a simple ring of finite dimension over its center F with complete 
discrete multiplicative valuation g such that A is crossed product of some 
semisimple subalgebra E. 

Then there is a hereditary Kuerszak extension ®, of g that is unique up 
to inner automorphism of E. 

For given ®, there is a Kuerszak extension ®, of (Kuerszak extension of 
®,) which is unique up to inner transformation by the units of the center of 
E. 
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